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STREPTOCOCCUS PNEUMONIAE PROTEINS AND NUCLEIC ACID MOLECULES 

The present invention relates to proteins derived from Streptococcus pneumoniae, 
nucleic acid molecules encoding such proteins, the use of the nucleic acid and/or 
proteins as antigens/immunogens and in detection/diagnosis, as well as methods for 
5 screening the proteins/nucleic acid sequences as potential anti-microbial targets. 

Streptococcus pneumoniae, commonly referred to as the pneumococcus, is an 
important pathogenic organism. The continuing significance of Streptoccocus 
pneumoniae infections in relation to human disease in developing and developed 

10 countries has been authoritatively reviewed (Fiber, G.R., Science, 265: 1385-1387 
(1994)). That indicates that on a global scale this organism is believed to be the 
most common bacterial cause of acute respiratory infections, and is estimated to 
result in 1 million childhood deaths each year, mostly in developing countries 
(Stansfield, S.K., Pediatr. Infect. Dis., 6: 622 (1987)). In the USA it has been 

15 suggested (Breiman et al. Arch. Intern. Med., 150: 1401 (1990)) that the 
pneumococcus is still the most common cause of bacterial pneumonia, and that 
disease rates are particularly high in young children, in the elderly, and in patients 
with j predisposing conditions such as asplenia, heart, lung and kidney disease, 
diabetes, alcoholism, or with immunosupressive disorders, especially AIDS. These 

20 groups are at higher risk of pneumococcal septicaemia and hence meningitis and 
therefore have a greater risk of dying from pneumococcal infection. The 
pneumococcus is also the leading cause of otitis media and sinusitis, which remain 
prevalent infections in children in developed countries, and which incur substantial 
costs . 

25 

The need for effective preventative strategies against pneumococcal infection is 
highlighted by the recent emergence of penicillin-resistant pneumococci. It has been 
reported that 6.6% of pneumoccal isolates in 13 US hospitals in 12 states were found 
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to be resistant to penicillin and some isolates were also resistant to other antibiotics 
including third generation cyclosporins (Schappert, S.M., Vital and Health Statistics 
of the Centres for Disease Control/National Centre for Health Statistics^ 214:1 
(1992)). The rates of penicillin resistance can be higher (up to 20%) in some 
hospitals (Breiman et at, J. Am. Med. Assoc., 271: 1831 (1994)). Since the 
development of penicillin resistance among pneumococci is both recent and sudden, 
coming after decades during which penicillin remained an effective treatment, these 
findings are regarded as alarming. 

For the reasons given above, there are therefore compelling grounds for considering 
improvements in the means of preventing, controlling, diagnosing or treating 
pneumococcal diseases. 

Various approaches have been taken in order to provide vaccines for the prevention 
of pneumococcal infections . Difficulties arise for instance in view of the variety of 
serotypes (at least 90) based on the structure of the polysaccharide capsule 
surrounding the organism. Vaccines against individual serotypes are not effective 
against other serotypes and this means that vaccines must include polysaccharide 
antigens from a whole range of serotypes in order to be effective in a majority of 
cases . An additional problem arises because it ahs been found that the capsular 
polysaccharides (each of which determines the serotype and is the major protective 
antigen) when purified and used as a vaccine do not reliably induce protective 
antibody responses in children under two years of age, the age group which suffers 
the highest incidence of invasive pnetimococcal infection and meningitis. 

A modification of the approach using capsule antigens relies on conjugating the 
polysaccharide to a protein in order to derive an enhanced immune response, 
particularly by giving the response T-cell dependent character. This approach has 
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been used in the development of a vaccine against Haemophilus influenzae. There 
are issues of cost concerning both the multi-polysaccharide vaccines and those based 
on conjugates. 

5 

A third approach is to look for other antigenic components which offer the potential 
to be vaccine candidates. In the present application we provide a group of proteins 
antigens which are secreted/exported proteins . 

10 Thus, in a first aspect the present invention provides a Streptococcus pneumoniae 
protein or polypeptide having a sequence selected from those shown in table 2 
herein. 

A protein or polypeptide of the present invention may be provided in substantially pure 
15 form. For example, it may be provided in a form which is substantially free of other 
proteins. 

In a preferred embodiment, a protein or polypeptide having an amino acid sequence as 
shown in Table 3 is provided. 

20 

The invention encompasses any protein coded for by a nucleic acid sequence as shown 
in Table 1 herein. 

As discussed herein, the proteins and polypeptides of the invention are useful as 
25 antigenic material. Such material can be "antigenic" and/or "immunogenic". 
Generally, "antigenic" is taken to mean that the protein or polypeptide is capable of 
being used to raise antibodies or indeed is capable of inducing an antibody response in 
a subject. "Immunogenic" is taken to mean that the protein or polypeptide is capable of 
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eliciting a protective immune response in a subject. Thus, in the latter case, the protein 
or polypeptide may be capable of not only generating an antibody response and in 
addition non-antibody based immune responses. 



The skilled person will appreciate that homologues or derivatives of the proteins or 
10 polypeptides of the invention will also find use in the context of the present invention, 
ie as antigenic/immunogenic material. Thus, for instance proteins or polypeptides 
which include one or more additions, deletions, substitutions or the like are 
encompassed by the present invention. In addition, it may be possible to replace one 
amino acid with another of similar "type". For instance replacing one hydrophobic 
15 amino acid with another. One can use a program such as the CLUSTAL program to 
compare amino acid sequences. This program compares amino acid sequences and 
finds the optimal alignment by inserting spaces in either sequence as appropriate. It is 
possible to calculate amino acid identity or similarity (identity plus conservation of 
amino acid type) for an optimal alignment, A program like BLASTx will align the 
20 longest stretch of similar sequences and assign a value to the fit. It is thus possible to 
obtain a comparison where several regions of similarity are found, each having a 
different score. Both types of analysis are contemplated in the present invention. 

In the case of homologues and derivatives, the degree of identity with a protein or 
25 polypeptide as described herein is less important than that the homologue or derivative 
should retain its antigenicity or immunogenicity to streptoccocus pneumoniae. 
However, suitably, homologues or derivatives having at least 60% similarity (as 
discussed above) with the proteins or polypeptides described herein are provided. 
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Preferably, homologues or derivatives having at least 70% similarity, more preferably 
at least 80% similarity are provided. Most preferably, homologues or derivatives 
having at least 90% or even 95% similarity are provided. 

5 In an alternative approach, the homologues or derivatives could be fusion proteins, 
incorporating moieties which render purification easier, for example by effectively 
tagging the desired protein or polypeptide. It may be necessary to remove the "tag" or 
it may be the case that the fusion protein itself retains sufficient antigenicity to be 
useful. 

10 

In an additional aspect of the invention there are provided antigenic fragments of the 
proteins or polypeptides of the invention, or of homologues or derivatives thereof. 

For fragments of the proteins or polypeptides described herein, or of homologues or 
15 derivatives thereof, the situation is slightly different. It is well known that is possible to 
screen an antigenic protein or polypeptide to identify epitopic regions, ie those regions 
which are responsible for the protein or polypeptide's antigenicity or immunogenicity . 
Methods for carrying out such screening are well known in the art. Thus, the fragments 
of the present invention should include one or more such epitopic regions or be 
20 sufficiently similar to such regions to retain their antigenic/inGmunogenic properties. 
Thus, for fragments according to the present invention the degree of identity is perhaps 
irrelevant, since they may be 100% identical to a particular part of a protein or 
polypeptide, homologue or derivative as described herein. The key issue, once again, is 
that the fragment retains the antigenic/immunogenic properties. 

25 

Thus, what is important for homologues, derivatives and fragments is that they possess 
at least a degree of the antigenicity/immunogenicity of the protein or polypeptide from 
which they are derived. 
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Gene cloning techniques may be used to provide a protein of the invention in 
substantially pure form. These techniques are disclosed, for example, in J. Sambrook 
et al Molecular Cloning 2nd Edition, Cold Spring Harbor Laboratory Press (1989). 
5 Thus, in a fourth aspect, the present invention provides a nucleic acid molecule 
comprising or consisting of a sequence which is: 



(i) any of the DNA sequences set out in Table 1 or their RNA equivalents; 



10 (ii) a sequence which is complementary to any of the sequences of (i); 



(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 



15 (iv) a sequence which is has substantial identity with any of those of (i), (ii) 

and (iii); 



(v) a sequence which codes for a homologue, derivative or fragment of a 
protein as defined in Table 1 . 

20 

In a fifth aspect the present invention provides a nucleic acid molecule comprising or 
consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 4 or their RNA equivalents; 

25 

(ii) a sequence which is complementary to any of the sequences of (i); 
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(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 

(iv) a sequence which is has substantial identity with any of those of (i), (ii) 
and (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a protein 
as defined in Table 4. 

The nucleic acid molecules of the invention may include a plurality of such sequences, 
and/or fragments. The skilled person will appreciate that the present invention can 
include novel variants of those particular novel nucleic acid molecules which are 
exemplified herein, „ ..Such variants are encompassed by the present invention. These 
may occur in nature, for example because of strain variation. For example, additions, 
substitutions and/or deletions are included. In addition, and particularly when utilising 
microbial expression systems, one may wish to engineer the nucleic acid sequence by 
making use of known preferred codon usage in the particular organism being used for 
expression. Thus, synthetic or non-naturally occurring variants are also included within 
the scope of the invention. 

The term "RNA equivalent" when used above indicates that a given RNA molecule has 
a sequence which is complementary to that of a given DNA molecule (allowing for the 
fact that in RNA "U" replaces "T" in the genetic code). 

When comparing nucleic acid sequences for the purposes of determining the degree of 
homology or identity one can use programs such as BESTFIT and GAP (both from the 
Wisconsin Genetics Computer Group (GCG) software package) BESTFIT, for 
example, compares two sequences and produces an optimal alignment of the most 
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similar segments. GAP enables sequences to be aligned along their whole length and 
finds the optimal alignment by insertmg spaces in either sequence as appropriate. 
Suitably, in the context of the present invention compare when discussing identity of 
nucleic acid sequences, the comparison is made by alignment of the sequences along 
5 their whole length. 

Preferably, sequences which have substantial identity have at least 50% sequence 
identity, desirably at least 75% sequence identity and more desirably at least 90 or at 
least 95% sequence identity with said sequences. In some cases the sequence identity 
10 may be 99% or above. 

Desirably, the term "substantial identity" indicates that said sequence has a greater 
degree of identity with any of the sequences described herein than with prior art nucleic 
acid sequences. 

15 

It should however be noted that where a nucleic acid sequence of the present invention 
codes for at least part of a novel gene product the present invention includes within its 
scope all possible sequence coding for the gene product or for a novel part thereof. 

20 The nucleic acid molecule may be in isolated or recombinant form. It may be 
incorporated into a vector and the vector may be incorporated into a host. Such vectors 
and suitable hosts form yet further aspects of the present invention. 

Therefore, for example, by using probes based upon the nucleic acid sequences 
25 provided herein, genes in Streptococcus pneumoniae can be identified. They can then 
be excised using restriction enzymes and cloned into a vector. The vector can be 
introduced into a suitable host for expression. 
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Nucleic acid molecules of the present invention may be obtained from S,pneumoniae by 
the use of appropriate probes complementary to part of the sequences of the nucleic 
acid molecules. Restriction enzymes or sonication techniques can be used to obtain 
appropriately sized fragments for probing. 

5 

Alternatively PGR techniques may be used to amplify a desired nucleic acid sequence. 
Thus the sequence data provided herein can be used to design two primers for use in 
PGR so that a desired sequence, including whole genes or fragments thereof, can be 
targeted and then amplified to a high degree. One primer will normally show a high 
10 degree of specificity for a first sequence located on one strand of a DNA molecule, and 
the other primer will normally show a high degree of specificity for a second sequence 
located on the complementary strand of the DNA sequence and being spaced from the 
complementary sequence to the first sequence. 

15 Typically primers will be at least 15-25 nucleotides long. 

As a further alternative chemical synthesis may be used. This may be automated. 
Relatively short sequences may be chemically synthesised and ligated together to 
provide a longer sequence. 

20 

In yet a further aspect the present invention provides an immunogenic/antigenic 
composition comprising one or more proteins or polypeptides selected from those 
whose sequences are shown in Tables 2-4, or homologues or derivatives thereof, 
and/or fragments of any of these. In preferred embodiments, the 
25 immunogenic/ antigenic composition is a vaccine or is for use in a diagnostic assay. 

In the case of vaccines suitable additional excipients, diluents, adjuvants or the like 
may be included. Numerous examples of these are well known in the art. 
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It is also possible to utilise the nucleic acid sequences shown in Table 1 in the 
preparation of so-called DNA vaccines. Thus, the invention also provides a vaccine 
composition comprising one or more nucleic acid sequences as defined herein. The 
5 use of such DNA vaccines is described in the art. See for instance, Donnelly et al , 
Ann, Rev. Immunol, 15:617-648 (1997). 

As already discussed herein the proteins or polypeptides described herein, their 
homologues or derivatives, and/or fragments of any of these, can be used in methods 

10 of detecting/diagnosing S. pneumoniae. Such methods can be based on the detection 
of antibodies against such proteins which may be present in a subject. Therefore the 
present invention provides a method for the detection/diagnosis of S.pneumoniae 
which comprises the step of bringing into contact a sample to be tested with at least 
one protein, or homologue, derivative or fragment thereof, as described herein. 

15 Suitably, the sample is a biological sample, such as a tissue sample or a sample of 
blood or saliva obtained from a subject to be tested. 

In an alternative approach, the proteins described herein, or homologues, derivatives 
and/or fragments thereof, can be used to raise antibodies, which in turn can be used 
20 to detect the antigens, and hence S.pneumoniae. Such antibodies form another aspect 
of the invention. Antibodies within the scope of the present invention may be 
monoclonal or polyclonal. 

Polyclonal antibodies can be raised by stimulating their production in a suitable animal 
25 host (e.g. a mouse, rat, guinea pig, rabbit, sheep, goat or monkey) when a protein as 
described herein, or a homologue, derivative or fragment thereof, is injected into the 
animal. If desired, an adjuvant may be administered together with the protein. Well- 
known adjuvants include Freund's adjuvant (complete and incomplete) and aluminium 



wo 00/06737 PCT/GB99/02451 

11 

hydroxide. The antibodies can then be purified by virtue of their binding to a protein as 
described herein. 

Monoclonal antibodies can be produced from hybridomas. These can be formed by 
5 fusing myeloma cells and spleen cells which produce the desired antibody in order to 
form an immortal cell line. Thus the well-known Kohler & Milstein technique {Nature 
256 (1975)) or subsequent variations upon this technique can be used. 

Techniques for producing monoclonal and polyclonal antibodies that bind to a 
10 particular polypeptide/protein are now well developed in the art. They are discussed in 
standard immunology textbooks, for example in Roitt et al. Immunology second edition 
(1989), Churchill Livingstone, London. 

In addition to whole antibodies, the present invention includes derivatives thereof which 
15 are capable of binding to proteins etc as described herein. Thus the present invention 
includes antibody fragments and synthetic constructs. Examples of antibody fragments 
and synthetic constructs are given by Dougall et al in Tibtech 12 372-379 (September 
1994). 

20 Antibody fragments include, for example. Fab, F(ab')2 and Fv fragments. Fab 
fragments (These are discussed in Roitt et al \supra^ ). Fv fragments can be modified 
to produce a synthetic construct known as a single chain Fv (scFv) molecule. This 
includes a peptide linker covalently joining Vu and regions, which contributes to the 
stability of the molecule. Other synthetic constructs that can be used include CDR 

25 peptides. These are synthetic peptides comprising antigen-binding determinants. 
Peptide mimetics may also be used. These molecules are usually conformationally 
restricted organic rings that mimic the structure of a CDR loop and that include 
antigen-interactive side chains. 
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Synthetic constructs include chimaeric molecules. Thus, for exanaple, humanised (or 
primatised) antibodies or derivatives thereof are within the scope of the present 
invention. An example of a humanised antibody is an antibody having human 
5 framework regions, but rodent hypervariable regions. Ways of producing chimaeric 
antibodies are discussed for example by Morrison et al in PNAS, 81, 6851-6855 (1984) 
and by Takeda et al in Nature. 314, 452-454 (1985). 

Synthetic constructs also include molecules comprising an additional moiety that 
10 provides the molecule with some desirable property in addition to antigen binding. For 
example the moiety may be a label (e.g. a fluorescent or radioactive label). 
Alternatively, it may be a pharmaceutically active agent. 

Antibodies, or derivatives thereof, find use in detection/diagnosis of S, pneumoniae, 
15 Thus, in another aspect the present invention provides a method for the 
detection/diagnosis of S.pneumoniae which comprises the step of bringing into contact 
a sample to be tested and antibodies capable of binding to one or more proteins 
described herein, or to homologues, derivatives and/or fragments thereof. 

20 In addition, so-called "Affibodies" may be utilised. These are binding proteins 
selected from combinatorial libraries of an alpha-helical bacterial receptor domain 
(Nord et al ,) Thus, Small protein domains, capable of specific binding to different 
target proteins can be selected using combinatorial approaches . 

25 

It will also be clear that the nucleic acid sequences described herein may be used to 
detect/diagnose S, pneumoniae. Thus, in yet a further aspect, the present invention 
provides a method for the detection/diagnosis of S.pneumoniae which comprises the 
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step of bringing into contact a sample to be tested with at least one nucleic acid 
sequence as described herein. Suitably, the sample is a biological sample, such as a 
tissue sample or a sample of blood or saliva obtained from a subject to be tested. 
Such samples may be pre-treated before being used in the methods of the invention. 
5 Trhus, for example, a sample may be treated to extract DNA. Then, DNA probes 
based on the nucleic acid sequences described herein (ie usually fragments of such 
sequences) may be used to detect nucleic acid from S.pneumoniae, 

In additional aspects, the present invention provides: 

10 

(a) a method of vaccinating a subject against S, pneumoniae which comprises the 
step of administering to a subject a protein or polypeptide of the invention, or a 
derivative, homologue or fragment thereof, or an immunogenic composition of the 
invention; 

15 

(b) a method of vaccinating a subject against S. pneumoniae which comprises the 
step of administering to a subject a nucleic acid molecule as defined herein; 

(c) a method for the prophylaxis or treatment of S.pneumoniae infection which 
20 comprises the step of administering to a subject a protein or polypeptide of the 

invention, or a derivative, homologue or fragment thereof, or an immunogenic 
composition of the invention; 

(d) a method for the prophylaxis or treatment of S,pneumoniae infection which 
25 comprises the step of administering to a subject a nucleic acid molecule as defined 

herein; 



(e) a kit for use in detecting/diagnosing S.pneumoniae infection comprising one 
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or more proteins or polypeptides of the invention, or homologues, derivatives or 
fragments thereof, or an antigenic composition of the invention; and 

(f) a kit for use in detecting/diagnosing S, pneumoniae infection comprising one 
5 or more nucleic acid molecules as defined herein. 

Given that we have identified a group of important proteins, such proteins are 
potential targets for anti-microbial therapy. It is necessary, however, to determine 
whether each individual protein is essential for the organism's viability. Thus, the 
10 present invention also provides a method of determining whether a protein or 

polypeptide as described herein represents a potential anti-microbial target which 
comprises inactivating said protein and determining whether S.pneumoniae is still 
viable, in vitro or in vivo, 

15 A suitable method for inactivating the protein is to effect selected gene knockouts, ie 
prevent expression of the protein and determine whether this results in a lethal 
change. Suitable methods for carrying out such gene knockouts are described in Li et 
at, P,N,A.S., 94:13251-13256 (1997). 

20 In a final aspect the present invention provides the use of an agent capable of 

antagonising, inhibiting or otherwise interfering with the function or expression of a 
protein or polypeptide of the invention in the manufacture of a medicament for use in 
the treatment or prophylaxis of S, pneumoniae infection. 

25 The invention will now be described with reference to the following examples, 
which should not be construed as in any way limiting the invention. The examples 
refer to the figures in which: 
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Figure 1: shows the results of various DNA vaccine trials; and 
Figure 2: shows the results of further DNA vaccine trials. 

EXAMPLE 1 

5 

The Genome sequencing of Streptococcus pneumoniae type 4 is in progress at the 

Institute for Genomic Research (TIGR, Rockville, MD, USA). Up to now, the 
whole sequence has not been completed or published. On November 1997, the 

10 TIGR centre released some DNA sequences as contigs which are not accurate 
reflections of the finished sequence. These contigs can be downloaded from their 
Webster (www@tigr.org) . We downloaded these contigs and created a local database 
using the application GCGToBLAST (Wisconsin Package Version 9.1, Genetics 
Computer Group (GCG), Madison, USA). This database can be searched with the 

15 Fast A and TfastA procedures (using the method of Pearson and Lipman (PNAS USA, 
85:2444-2448 (1988)). 

Using Fast A and TfastA procedures, the local pneumococcus database was searched 
for putative leader sequence or anchor sequence features. Relevant sequences were 
20 used to interrogate for comparative novel sequences. These were: 

(i) already described leader sequences of Streptococcus pneumoniae (from 
proteins NanA, NanB, LytA, Pap A, pep A, PsaA and PspA); 

25 (ii) the leader sequence of Usp45, a secreted protein from Lactococcus lactis; 

(iii) new hypothetical leader sequences derived from the searches in (i) and 

(ii) ; 
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(iv) the anchor motif LPxTG, a feature common to many Gram-positive 
bacteria surface proteins which are anchored by a mechanism involving the 
Sortase complex proteins. 

5 

Provided below is an example of this approach, with reference to the sequences 
derived from the database (see table 1). 
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Usp45 



NanA 



PapA 



PsaA 



PspA 



PrtP 






4121.1 4101.3 4172.4 4106.1 4346.1 4194.1 



4172.2 



4147.2 




i 



10 



4191.6 



4137.3 4193.1 4122.1 



15 




4170.8 4175.3 



4187.2 4188.1 4102.1 




4136.1 



4127.1 4191.1 



20 



4119.1 



4119.2 



The protein leader sequences of different known exported proteins were used as a 
starting point for a search of the local pneumococcus database described above. The 
hypothetical proteins found with this search were then submitted to a Blast search in 
25 general databases such as EMBL, Swissprot etc. Proteins remaining unknown in the 
pneumococcus are kept and annotated. Then the search is performed again using the 
new potential protein leader sequence as a probe, using the TfastA procedure. 
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Example 2: DNA vaccine trials 
pcDNA3.1+ as a DNA vaccine vector 
5 pcDNA3.1 + 

The vector chosen for use as a DNA vaccine vector was pcDN A3 . 1 (Invitrogen) 
(actually pcDN A3 . 1 + , the forward orientation was used in all cases but may be 
referred to as pcDN A3 . 1 here on) . This vector has been widely and successfully 

10 employed as a host vector to test vaccine candidate genes to give protection against 
pathogens in the literature (Zhang, et aL, Kurar and Splitter, Anderson et aL). The 
vector was designed for high-level stable and non-replicative transient expression in 
manomalian cells. pcDNA3.1 contains the ColEl origin of replication which allows 
convenient high-copy number replication and growth in E, coli. This in turn allows 

15 rapid and efficient cloning and testing of many genes. The pcDNA3.1 vector has a 
large number of cloning sites and also contains the gene encoding ampicillin 
resistance to aid in cloning selection and the human cytomegalovirus (CMV) 
immediate-early promoter/enhancer which permits efficient, high-level expression of 
the recombinant protein. The CMV promoter is a strong viral promoter in a wide 

20 range of cell types including both muscle and immune (antigen presenting) cells. 

This is important for optimal immune response as it remains unknown as to which 
cells types are most important in generating a protective response in vivo, A T7 
promoter upstream of the multiple cloning site affords efficient expression of the 
modified insert of interest and which allows in vitro transcription of a cloned gene in 

25 the sense orientation. 

Zhang, D., Yang, X., Berry, J. Shen, C, McClarty, G, and Brunham, R.C. (1997) 
"DNA vaccination with the major outer-membrane protein genes induces acquired 
immunity to Chlamydia trachomatis (mouse pneumonitis) infection" . Infection and 
30 Immunity, 176, 1035-40. 

Kurar, E. and Splitter, G.A. (1997) "Nucleic acid vaccination of Brucella abortus 
ribosomal L7/L72 gene elicits immune response". Vaccine, 15, 1851-57. 

35 Anderson, R., Gao, X.-M., Papakonstantinopoulou, A., Roberts, M. and Dougan, 
G. (1996) "Immune response in mice following immunisation with DNA encoding 
fragment C of tetanus toxin". Infection and Immunity, 64, 3168-3173. 

Preparation of DNA vaccines 

40 

Oligonucleotide primers were designed for each individual gene of interest derived 
using the LEEP system. Each gene was examined thoroughly, and where possible, 
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primers were designed such that they targeted that portion of the gene thought to 
encode only the mature portion of the gene protein. It was hoped that expressing 
those sequences that encode only the mature portion of a target gene protein, would 
facilitate its correct folding when expressed in mammalian cells. For example, in the 
5 majority of cases primers were designed such that putative N-terminal signal peptide 
sequences would not be included in the final amplification product to be cloned into 
the pcDNAS.l expression vector. The signal peptide directs the polypeptide 
precursor to the cell membrane via the protein export pathway where it is normally 
cleaved off by signal peptidase I (or signal peptidase II if a lipoprotein). Hence the 
10 signal peptide does not make up any part of the mature protein whether it be 

displayed on the surface of the bacteria surface or secreted. Where a N-terminal 
leader peptide sequence was not immediately obvious, primers were designed to 
target the whole of the gene sequence for cloning and ultimately, expression in 
pcDNAS.l. 

15 

Having said that, however, other additional features of proteins may also affect the 
expression and presentation of a soluble protein. DNA sequences encoding such 
features in the genes encoding the proteins of interest were excluded during the 
20 design of oligonucleotides. These features included: 

1. LPXTG cell wall anchoring motifs. 

2. LXXC ipoprotein attachment sites. 

3. Hydrophobic C-terminal domain. 

25 4. Where no N-terminal signal peptide or LXXC was present the start codon was 
excluded. 

5 . Where no hydrophobic C-terminal domain or LPXTG motif was present the stop 
codon was removed. 

30 Appropriate PCR primers were designed for each gene of interest and any and all of 
the regions encoding the above features was removed from the gene when designing 
these primers. The primers were designed with the appropriate enzyme restriction 
site followed by a conserved Kozak nucleotide sequence (in all cases) GCCACC was 
used. The Kozak sequence facilitates the recognition of initiator sequences by 

35 eukaryotic ribosomes) and an ATG start codon upstream of the insert of the gene of 
interest. For example the forward primer using a BamHl site the primer would 
begm GCGGGATCCGCCACCATG followed by a small section of the 5' end of the 
gene of interest. The reverse primer was designed to be compatible with the forward 
primer and with a Notl restriction site at the 5' end in all cases (this site is 

40 TTGCGGCCGC). 

PCR primers 
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The following PCR primers were designed and used to amplify the truncated genes 
of interest. 

5 

ID210 

Forward Primer 5' CGGATCCGCCACCATGTCTTCTAATGAATCTGCCGATG 
3' 

10 Reverse Primer 5' TTGCGGCCGCCTGTTTAGATTGGATATCTGTAAAGACTT 
3' 

4172.5 

15 Forward Primer 5' 

CGCGGATCCGCCACCATGGATTTTCCTTCAAATTTGGAGG 3 ' 
Reverse Primer 5' TTGCGGCCGCACCGTACTGGCTGCTGACT 3' 

ID211 

20 

Forward Primer 5' 

CGGATCCGCCACC ATGAGTGAGATC AA A ATT ATTA ACGC 3 ' 
Reverse Prhner 5' TTGCGGCCGCCGTTCCATGGTTGACTCCT 3' 

25 4197.4 

Forward Primer 5' CGCGGATCCGCCACCATGTGGGACATATTGGTGGAAAC 
3' 

Reverse Pruner 5' TTGCGGCCGCTTCACTTGAGCAAACTGAATCC 3' 

30 

4122.1 

Forward Primer 5' 

CGCGGATCCGCC ACC ATGTC AC A AGA AA A A AC A A A AAATGAA 3 ' 
35 Reverse Prhner 5 ' TTGCGGCCGC ATCGACGTAGTCTCCGCC 3 ' 

4126.7 

Forward Primer 5' 

40 CGCGGATCCGCC ACC ATGCTGGTTGGAACTTTCTACTATC A AT 3' 
Reverse Prhner 5' TTGCGGCCGC AACTTTCGTCCCTTTTTGG 3' 
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4188.11 

Forward Primer 5' CGCGGATCCGCCACCATGGGCAATTCTGGCGGAA 3' 
Reverse Primer 5' TTGCGGCCGCTTGTTTCATAGCTTTTTTGATTGTT 3' 

5 

ID209 

Forward Primer 5 ' 

CGCGGATCCGCCACCATGCTATTGATACGA A ATGC AGGG 3 " 
10 Reverse Primer 5' TTGCGGCCGCAACATAATCTAGTAAATAAGCGTAGCC 3' 

ID215 

Forward Primer 5' CGCGGATCCGCCACCATGACGGCGACGAATTTTC 3' 
15 Reverse Primer 5' TTGCGGCCGCTTAATTCGTTTTTGAACTAGTTGCT 3' 

4170.4 

Forward Primer 5' 

20 CGCGGATCCGCCACCATGGCTGTTTTTCTTCGCTATCATG 3' 

Reverse Primer 5' TTGCGGCCGCTTTCTTCAACAAACCTTGTTCTTG 3' 

4193.1 

25 Forward Primer 5' 

CGCGGATCCGCCACCATGGGTAACCGCTCTTCTCGTAAC 3 ' 
Reverse Primer 5' TTGCGGCCGCGCTTCCATCAAGGATTTTAGC 3' 

Cloning 

30 

The insert along with the flanking features described above was amplified using PGR 
against a template of genomic DNA isolated from type 4 S. pneumoniae strain 11886 
obtained from the National Collection of Type Cultures. The PCR product was cut 
with the appropriate restriction enzymes and cloned in to the multiple cloning site of 

35 pcDNA3.1 using conventional molecular biological techniques. Suitably mapped 
clones of the genes of interested were cultured and the plasmids isolated on a large 
scale (> 1.5 mg) using Plasmid Mega Kits (Qiagen). Successful cloning and 
maintenance of genes was confirmed by restriction mapping and sequencing ~ 700 
base pairs through the 5 ' cloning junction of each large scale preparation of each 

40 construct. 



Strain validation 
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A strain of type 4 was used in cloning and challenge methods which is the strain 
from which the S. pneumoniae genome was sequenced. A freeze dried ampoule of a 
homogeneous laboratory strain of type 4 5. pneumoniae strain NCTC 11886 was 
5 obtained from the National Collection of Type Strains . The ampoule was opened and 
the cultured re suspended with 0.5 ml of tryptic soy broth (0.5% glucose, 5% 
blood). The suspension was subcultured into 10 ml tryptic soy broth (0.5% glucose, 
5% blood) and incubated statically overnight at 37^C. This culture was streaked on 
to 5 % blood agar plates to check for contaminants and confirm viability and on to 
10 blood agar slopes and the rest of the culture was used to make 20% glycerol stocks. 
The slopes were sent to the Public Health Laboratory Service where the type 4 
serotype was confirmed. 

A glycerol stock of NCTC 11886 was streaked on a 5% blood agar plate and 
15 incubated overnight in a C02 gas jar at 37°C. Fresh streaks were made and optochin 
sensitivity was confirmed. 

Pneumococcal challenge 

20 A standard inoculum of type 4 5. pneumoniae was prepared and frozen down by 

passaging a culture of pneumococcus Ix through mice, harvesting from the blood of 
infected animals, and grown up to a predetermined viable count of around 10^ 
cfu/ml in broth before freezing down. The preparation is set out below as per the 
flow chart. 



25 



30 



35 



Streak pneumococcal culture and confirm identity 



Grow over-night culture from 4-5 colonies on plate above 



V 



Animal passage pneumococcal culture 
(i.p. injection of cardiac bleed to harvest) 



40 V 



Grow over-night culture from animal passaged pneumococcus 
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I 

V 

5 Grow day culture (to pre-determined optical density) from over-night of animal 

passage and freeze down at -70 °C - This is standard minimum 

I 

V 

10 

Thaw one aliquot of standard inoculum to viable count 

V 

15 

Use standard inoculum to determine effective dose (called Virulence Testing) 

I 

V 

20 

All subsequent challenges - use standard inoculum to effective dose 

An aliquot of standard inoculum was diluted 500x in PBS and used to inoculate the 
mice. 

25 

Mice were lightly anaesthetised using halothane and then a dose of 1.4 x 10^ cfu of 
pneumococcus was applied to the nose of each mouse. The uptake was facilitated by 
the normal breathing of the mouse, which was left to recover on its back. 

30 S. pneumoniae vaccine trials 

Vaccine trials in mice were carried out by the administration of DNA to 6 week old 
CBA/ca mice (Harlan, UK). Mice to be vaccinated were divided into groups of six 
and each group was inraiunised with recombinant pcDNAS . 1 + plasmid DNA 

35 containing a specific target-gene sequence of interest. A total of 100 ^tg of DNA in 
Dulbecco*s PBS (Sigma) was injected intramuscularly into the tibialis anterior 
muscle of both legs (50 ftl in each leg) . A boost was carried using the same 
procedure 4 weeks later. For comparison, control groups were included in all 
vaccine trials. These control groups were either unvaccinated animals or those 

40 administered with non-recombinant pcDNAS. 1 + DNA (sham vaccinated) only, 

using the same time course described above. 3 weeks after the second immunisation, 
all mice groups were challenged intra-nasally with a lethal dose of S. pneumoniae 
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serotype 4 (strain NCTC 11886). The number of bacteria administered was 
monitored by plating serial dilutions of the inoculum on 5% blood agar plates. A 
problem with intranasal inmiunisations is that in some mice the inoculum bubbles out 
of the nostrils, this has been noted in results table and taken account of in 
5 calculations. A less obvious problem is that a certain amount of the inoculum for 
each mouse may be swallowed. It is assumed that this amount will be the same for 
each mouse and will average out over the course of innoculations . However, the 
sample sizes that have been used are small and this problem may have significant 
effects in some experiments. All mice remaining after the challenge were killed 3 or 

10 4 days after infection. During the infection process, challenged mice were monitored 
for the development of symptoms associated with the onset of 5. pneumoniae 
induced-disease. Typical symptoms in an appropriate order included piloerection, 
an increasingly hunched posture, discharge from eyes, increased lethargy and 
reluctance to move. The latter symptoms usually coincided with the development of 

15 a moribund state at which stage the mice were culled to prevent further suffering. 

These mice were deemed to be very close to death, and the time of culling was used 
to determine a survival time for statistical analysis. Where mice were found dead, 
the survival time was taken as the last time point when the mouse was monitored 
alive. 

20 

Interpretation of Results 

A positive result was taken as any DNA sequence that was cloned and used in 
challenge experiments as described above which gave protection against that 

25 challenge. Protection was taken as those DNA sequences that gave statistically 

significant protection (to a 95% confidence level (p<0.05)) and also those which were 
marginal or close to significant using Mann- Whitney or which show some protective 
features for example there were one or more outlying mice or because the time to the 
first death was prolonged. It is acceptable to allow marginal or non-significant results 

30 to be considered as potential positives when it is considered that the clarity of some of 
the results may be clouded by the problems associated with the administration of 
intranasal infections. 
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Table 1 



4101.1 

ATGGAAGAGTTAGTGACCTTAGATTGTTTGTTTATTGACAGAACTAAGATTGAAGCCAATGCCAACAAGTATAGT^ 
5 TTGTGTGGAAGAAAACGACAGAGAAATTCrCCGCCAAACTTCAAGAACAGATACAGGTCTATTTTCAAGAAGAAA 
TCACTCCCCnrCTGATTAAATATGCCATGTTTGATAAGAAACAAAAGAGAGGGTATAAAGAGTCAGCTAAAAACT 
TAGCGAATTGGCACTATAATGACAAGGAGGATAGCTACACACATCCTGATGGCTGGTATTATCGTTTTCACCATAC 
CAAATATCAGAAAACACAGACAGACTTTCAACAAGAAATCAAGGTTTACTACGCCGACGAACCrGAATCAGCCCC 
TCAAAAGGGACTGTATATGAACGAACGCrATCAAAACTTGAAAGCTAAAGAATGTCAGGCGCTTT^ 
10 AGGTAGACAGATTTTCGCTCAACGCAAGATTGATGTGGAACCTGTCTTTGGGCAGATAAAGGCTTCTT^ 

AAGAGATGTAATCTGAGAGGGAAGCGTCAAGTGAGAATTGACATGGGATTGGTACTTATGGCCAATAACCTCCTA 
AAATATAGTAAAATGAAATAA 

4101.3 

15 ATGGGGAAAGGCCATTGGAATCGGAAAAGAGTTTATAGCATTCGTAAGTTTGCrOTGGGAGCTTGCTCAGTAATG 
ATTGGGACTTGTGCAGTTTTATTAGGAGGAAATATAGCTGGAGAATCTGTAGTTTATGCGG ATGA AACACITAT^ 
CTCATACTGCTGAGAAACCTAAAGAGGAAAAAATGATAGTAGAAGAAAAGGCTGATAAAGCTTTGGAAACT 
AATATAGTTGAAAGGACAGAACAAAGTGAACCTAGTTCAACTGAGGCTATTGCATCTGAGAAGAAAGAAGATGAA 
GCCGTAACTCCAAAAGAGGAAAAAGTGTCTGCTAAACCGGAAGAAAAAGCTCCAAGGATAGAATCACAAGCT^ 

20 AAATCAAGAAAAACCGCTCAAGGAAGATGCTAAAGCTGTAACAAATGAAGAAGTGAATCAAATGATTGAAGACA 
GGAAAGTGGATTTTAATCAAAATTGGTACITTAAACTCAATGCAAATTCT^ 

ACGTATCTACGTGGAAAAAATTAGATTTACCGTATGACTGGAGTATCTTTAACGATTTCGATCATG^ 

ACAAAATGAAGGTGGACAGCTCAACGGTGGGGAAGCTTGGTATCGCAAGACTTTCAAACTAGATGAAAAAGACCT 

CAAGAAAAATGTTCGCCTTACnTTTGATGGCGTCTACATGGATTCTCAAGTTTATGTCAAT^ 

25 CATTATCCAAATGGTTATAACCAGTTCTCATATGATATCACCAAATACCTTCAAAAAGATGGTCGTGAGAATGTGA 
TTGCTGTCCATGCAGTCAACAAACAGCCAAGTAGCCGTTGGTATTCAGGAAGTGGTATCTATCGTGATGTGACTTT 
ACAAGTGACAGATAAGGTGCATGTTGAGAAAAATGGGACAACTATTTTAACACCAAAACTTGAAGAACAA 
TGGCAAGGTTGAAACrCATGTGACCAGCAAAATCGTCAATACGGACGACAAAGACCATGAACTTGTAGCCGAATA 
TCAAATCGTTGAACGAGGTGGTCATGCTGTAACAGGCTTAGTTCGTACAGCGAGTCGTACCTTAAAAGCACATGA 

30 ATCAACAAGCCTAGATGCGATTTTAGAAGTTGAAAGACCAAAACTCTGGACTGTTTTAAATG 

TACGAATTGATTACGCGTGTTTACCGTGACGGTCAATTGGTTGATGCTAAGAAGGATTTGTTTGGTTACCGT^ 
ATCACTGGACTCCAAATGAAGGTTTCTCTTTGAATGGTGAACGTATTAAATTCCATGGAGTATCC^ 
CCATGGGGCGCTTGGAGCAGAAGAAAACTATAAAGCAGAATATCGCCGTCTCAAACAAATGAAGGAGATGGGAG 
TTAACTCCATCCGTACAACCCACAACCCTGCTAGTGAGCAAACCTTGCAAATCGCAGCAGAACTAGGTTTACTCG 

35 TCAGGAAGAGGCCnTTGATACGTGGTATGGTGGCAAGAAACCTTATGACTATGGACGTTTCm 

CACTCACCCAGAAGCTCGAAAAGGTGAAAAATGGTCTGATTTTGACCTACGTACCATGGTCGAAAGAGGCAAAAA 
CAACCCTGCTATCnTCATGTGGTCAATTGGTAATGAAATAGGTGAAGCTAATGGTGATGCCCACTCTTTAGCAACT 
GTTAAACGTTTGGTTAAGGTTATCAAGGATGTTGATAAGACTCGCTATGTTACCATGGGAGCAGATAAATTCCGT^ 
TCGGTAATGGTAGCGGAGGGCATGAGAAAATTGCTGATGAACTCGATGCTGTTGGATTTAACrATTCTGAAGATA 

40 ATTACAAAGCCCTTAGAGCTAAGCATCCAAAATGGTTGATTTATGGATCAGAAACATCTTCAGCTA 

TGGAAGTTACTATCGCCCTGAACGTGAATTGAAACATAGCAATGGACCTGAGCGTAATTATGAACAGTCAGATTA 
TGGAAATGATCGTGTGGGTTGGGGGAAAACAGCAACCGCTTCATGGACTTTTGACCGTGACAACGCTGGCTATGC 
TGGACAGTTTATCTGGACAGGTACGGACTATATTGGTGAACCTACACCATGGCACAACCAAAATCAAACrCCTGTT 
AAGAGCTCTTACnTTGGTATCGTAGATACAGCCGGCATTCCAAAACATGACTTCTATCTCTACCAAAGCCAATGGG 

45 rrrCTGTTAAGAAGAAACCGATGGTACACCITCTTCCTCACTGGAACTGGG^^ 

TAGCTGACTCAGAAGGTAAGATTCCAGTTCGTGCnTATTCGAATGCnTCTAGTGTAGAATTGTTC^ 
ATCTCTTGGTCTTAAGACTTTCAATAAAAAACAAACCAGCGATGGGCGGACTTACCAAGAAGGTGCA 
TGAACTTTATCITGAATGGAAAGTTGCCTATCAACCAGGTACCTTGGAAGCAATTGCTCGTGATG^ 
GAAATTGCTCGAGATAAGATTACGACTGCTGGTAAGCCAGCGGCAGTTCGTCnTATTAAGGAAGACCATGCGAT^ 

50 GCAGCAGATGGAAAAGACITGACrTACATCTACTATGAAATTGTTGACAGCCAGGGGAATGT^ 

AATAATCTGGTTCGCTTCCAATTGCATGGCCAAGGTCAACTGGTCGGTGTAGATAACGGAGAACAAGCCAGCCGT 
GAACGCTATAAGGCGCAAGCAGATGGTTCTTGGATTCGTAAAGCATTTAATGGTAAAGGTGTTGCCATTG 
CAACTGAACAAGCAGGGAAATTCACCCTGACrGCCCACTCTGATCTCTTGAAATCGAACCAAGTCACTGT(^^ 
TGGTAAGAAAGAAGGACAAGAGAAGACTGTTTTGGGGACAGAAGTGCCAAAAGTACAGACCATTATTGGAGAGG 

55 CACCTGAAATGCCTACCACTGTTCCGTTTGTATACAGTGATGGTAGCCGTGCAGAACGTCCTGTAACCTGGTCTTC 
AGTAGATGTGAGCAAGCCTGGTATTGTAACGGTGAAAGGTATGGCTGACGGACGAGAAGTAGAAGCTCGTGTAGA 
AGTGATTGCTCTTAAATCAGAGCTACCAGTTGTGAAACGTATTGCrCCAAATACTGACTTGAATTCTGTAGACA^ 
TCTGTTTCCTATGTTTTGATTGATGGAAGTGTTGAAGAGTATGAAGTGGACAAGTGGGAGATTGCCGAAGAAGATA 
AAGCT AAGTTAGCAATTCCAGGTTCrCGTATTCAAGCGACCGGTTATTTAGAAGGTCAACCAATTCATGCAACCCT 

60 TGTGGTAGAAGAAGGCAATCCTGCGGCACCTGCAGTACCAACTGTAACGGTTGGTGGTGAGGCAGTAACAGGTCT 
TACTAGTCAAAAACCAATGCAATACCGCACTCTTGCITATGGAGCTAAGTTGCCAGAAGTCACAGCAAGTGCT^ 
AAATGCAGCTGTTACAGTTCTTCAAGCAAGCGCAGCAAACGGCATGCGTGCGAGCATCTTTATTCAGCCTAAAGA 
TGGTGGCCCTCTTCAAACCTATGCAATTCAATTCCTTGAAGAAGCGCCAAAAATTGCTCACTTGAGCTTGC^ 
GAAAAAGCTGACAGTCTCAAAGAAGACCAAACTGTCAAATTGTCGGTTCGAGCTCACTATCAAGATGGAACGCAA 
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GCTGTATTACCAGCTGATAAAGTAACCTTCTCTACAAGTGGTGAAGGGGAAGTCGCAATTCGTAAAGGAATGCrT 
GAGTTGCATAAGCCAGGAGCAGTCACTCrGAACGCTGAATATGAGGGAGCTAAAGACCAAGTTGAACTCACTATC 
CAAGCCAATACTGAGAAGAAGATTGCGCAATCCATCCGTCCTGTAAATGTAGTGACAGATTTGCATCAGGAACCA 
AGTCnrCCAGCAACAGTAACAGTTGAGTATGACAAAGGTTTCCCTAAAACTCATAAAGTCACTTGGCAAGCTAT^ 
5 CGAAAGAAAAACTAGACTCCTATCAAACATTTGAAGTACTAGGTAAAGTTGAAGGAATTGACCTTGAAGCGCGT^ 
CAAAAGTCTCTGTAGAAGGTATCGTTTCAGTTGAAGAAGTCAGTGTGACAACTCCAATCGCAGAAGCACCACAAT 
TACCAGAAAGTGTTCGGACATATGATTCAAATGGTCACGTTTCATCAGCTAAGGTTGCATGGGATGCGATTCGTCC 
AGAGCAATACGCTAAGGAAGGTGTCTTTACAGTTAATGGTCGCTTAGAAGGTACGCAATTAACAACTAAACT^ 
TGTTCGCGTATCTGCTCAAACTGAGCAAGGTGCAAACATTTCTGACCAATGGACCGGTTCAGAATTGCCACr^ 

10 TTTGCTTCAGACTCAAATCCAAGCGACCCAGTTTCAAATGTTAATGACAAGCTCATTTCCTACAATAACC^ 

CCAATCGTTGGACAAACTGGAATCGTACTAATCCAGAAGCTTCAGTCGGTGTTCrGTTTGGAGATTCAGGTATC^ 
GAGCAAACGCTCCGTTGATAATCTAAGTGTCGGATTCCATGAAGACCATGGAGTTGGTGTACCGAAGTCTTATGTG 
ATTGAGTATTATGTTGGTAAGACTGTCCCAACAGCTCCTAAAAACCCTAGTTTTGTTGGTAATGAGGACCA 
TTAATGATTCTGCCAACTGGAAACCAGTTACTAATCTAAAAGCCCCTGCTCAACTCAAGGCTGG^ 

15 ACTTTAGCTTTGATAAAGTTGAAACCTATGCTGTTCGTATTCGCATGGTTAAAGCA 

TATCACAGAGGTACAAATCTTTGCGAAACAAGTTGCGGCAGCCAAGCAAGGACAAACAAGAATCCAAGT^^ 
CAAAGACnTAGCAAACnTCAACCCTGATTTGACAGACTACTACCTTGAGTCTGTAGATGG^ 
ACAGCAAGTGTTAGCAACAATGGTCTCGCTACCGTCGTTCCAAGCGTTCGTGAAGGTGAGCCAGT TCGTG TCATCG 
CGAAAGCTGAAAATCGCGACATCTTAGGAGAATACCGTCTGCACTTCACTAAGGATAAGAGCT^ 

20 AACCAGTTGCTGCGGTTAAACAAGCTCGCTTGCTACAAGTAGGTCAAGCACTTGAATTGCCGACT 

TTACTTCACAGGTAAAGACGGCTACGAAACAAAAGACCTGACAGTTGAATGGGAAGAAGTTCCAGCGGAAAATCT 
GACAAAAGCAGGTCAATTTACrGTTCGAGGCCGTGTCCTTGGTAGTAACCTTGTTGCTGAGATCACTGTAC 
ACAGACAAACTTGGTGAGACTCnTTCAGATAACCCTAACTATGATGAAAACAGTAACCAGGCCTTTG 
ACCAATGATATTGACAAAAACTCTCATGACCGCGTTGACTATCTCAATGACGGAGATCATTCAGAAAATCGTCGTT 

25 GGACAAACTGGTCACCAACACCATCTTCTAATCCAGAAGTATCAGCGGGTGTGATTTTCCGTGAAAATGGTAAGA 
TTGTAGAACGGACTGTTACACAAGGAAAAGTTCAGTTCTTTGCAGATAGTGGTACGGATGCACCATCTAAACTCGT 
TTTAGAACGCTATGTCGGTCCAGAGTTTGAAGTGCCAACCTACTATTCAAACTACCAAGCCTACGACGCAGACCAT 
CCATTCAACAATCCAGAAAATTGGGAAGCTGTTCCnTATCGTGCGGATAAAGACATTGCAGCTGGTGATGAAATC 
AACGTAACATTTAAAGCTATCAAAGCCAAAGCTATGAGATGGCGTATGGAGCGTAAAGCAGATAAGAGCGGTGTT 

30 GCGATGATTGAGATGACCTTCCITGCACCAAGTGAATTGCCTCAAGAAAGCACTCAATCAAAGATTCT^ 

GAAAAGAACITGCTGATTTCGCTGAAAATCGTCAAGACTATCAAATTACCTATAAAGGTCAACGGCCA/^ 
CAGTTGAAGAAAACAATCAAGTAGCTTCAACTGTGGTAGATAGTGGAGAAGATAGCTTTCCAGTACTTGT^ 
CGTTTCAGAAAGTGGAAAACAAGTCAAGGAATACCGTATCCACTTGACTAAGGAA 
AAACCAGTTTCTGAGAAGACAGTTGCTGCTGTACAAGAAGATCITCCAAAAATCGAATTTGT^ 

35 GCATACAAGACAGTTGAGAAAAAAGATTCAACACTGTATCTAGGTGAAACTCGTGTAGAACAAGAAGGAAAAGTT 
GGAAAAGAACGTATCTTTACAGCGATTAATCCTGATGGAAGTAAGGAAGAAAAACTCCGTGAAGTGGTAG/^ 
CCGACAGACCGCATCGTCTTGGTTGGAACCAAACCAGTAGCTCAAGAAGCTAAAAAACCACAAGTGTCAGAAAAA 
GCAGATACAAAACCAATTGATTCAAGTGAAGCTAGTCAAACTAATAAAGCCCAGTTACCAAGTACAGGTAGTGCG 
GCAAGCCAAGCAGCAGTAGCAGCAGGTTTAACTCITCTAGGTTTGAGTGCAGGATTAGTAGTTACTAAAGGTAAA 

40 AAAGAAGACTAG 



4101.5 

ATGGATGCAATCITTGACCTAATCGGAAAGGTTTTCAATCCCATCTTAGAAATGGGTGGACCTGTCATCA 

45 TCATTTTGACAGTATTGGCTTTACTTTTTGGAGTGAAATTCTCCAAAGCGC^ 

CGCTCTTACAGGTATCGGTGCTATCATCGGTATGCTAAACACTGCTTTCrCAGCATCACTAGCA^ 
AACACTGGTATCCAATTGAGTATTACCGACGTTGGTTGGGCACCACTTGCTACAATCACnTGGGGTTCTGCT^ 
CACTATACITCTTGCTCATCATGTTGATTGTCAACATAGTGATGCTAGCrATGAAGAAAACAGATACA 
CGATATCTTTGATATCTGGCACTTGTCTATCACAGGTCTCTTGATTAAATGGTATGCTGAT^ 

50 CAAGGGGTTTCACrcriTATTGCTACAGCAGCTATCGTCCrTGTCGGTGTGT^ 

GAAACCTACATTTGATGACCnrCITAACGCCCCAAGTTCATCACCAATGACATCAACrrCACATGi ^ 

AACCCAGTTATCATGGTTTTGGATAAGATTTTTGAAAAATTCTTCCCAGGC<nTGA 

CTAAATTGAACAAGAAAATCGGTTTCTGGGGATCTAAATTCITCATCGGTTTCATCCn^ 

ATGGGAACrCCACATCCAATTGCAGGTGTTGCAGATGCAGATAAATGGCGTCTTGTTATCAAAGGATGGTTGTCTC 

55 TTGGTTTGACTGCCGGTGTATCITTGGAACTCTTCTCACITATCGGTTCATGGT^ 

TCACAAGGTATTACAAACGTTGCTACTAAACGTCTTCAAGGACGTAAATTCAATATCGGTCITGACTGGCCATTCA 
TCGCTGGTCGTGCTGAAATCTGGGCTTGTGCCAACGTACTTGCACCAATCATGTTGATTGAAGCAGTGCTTCTT^ 
AAAAGTTGGAAATGGTATCITGCCACrTGCAGGTATCATCGCTATGGGTGTTACTCCAGCTCrCTTGG 
CGTGGTAAATTGCTCCGTATGATTATCTTCGGAACACTCTTGTTGCCACTCrrCCTTCTTTCAGGTA 

60 ACCATTTGCAACAGAACTTGCTAAAGGTGTAGGTGCCTTCCCAGAAGGTGTGAGCCAAACTCAATTGATTACTCAC 
TCTACTCrTGAAGGACCAATCGAAAAACrTCTTGGTTGGACAATTGGTAACACTACAACTGGTGATATCA^ 
TCCTTGGTGCAGTAGTCTTCCITGTATTCTATATCGGTATCITTGCTTGGTACAGAAAACAAATGATC 
CGAAGAGTACGCAGCAAAAGCAAAATAA 

65 4102.1 
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ATGAAGATTATGAAAAAAAAATATTGGACTTTAGCGATATTATTCTTTTGTTTGTTC^ 

AGAAATACCTAAAAATCTTGATGGCAATATAACTCACACTCAGACTAGCGAAAGTTTTTCrGAAT 

ACAGGTTGACTATTCTAATAAAAATCAAGAAGAAGTAGACCAAAATAAATTTCGTATTCAAATCGATAAGAC^ 

ATTATTTGTAACAACAGATAAACATTTAGAAAAAAACTGTTGTAAATTGGAACTTGAACCA 

TATTGTTAACTCTGAAAGTAATAATTTACTAGGCGAAGATAATTTAGATAATAAAATTAAGGAV^ 

CTAGATAATAGAGGAGGAAATATAGAGCATGACAAAGATAACTTAGAATCGTCGATTGTAAGAAAATATGAATGG 

GATATAGATAAAGTTACTGGTGGAGGCGAAAGTTATAAATTATATTCTAAAAGTAATTCTAAAGTT^ 

TTTTAGATTCAGGAGTCGATTTACAAAATACTGGATTACTGAAAAATCTTTCAAATC^ 

CAATAAAGGATATTTAGGAAAAGAGGAGGGAGAGGAAGGAATAATATCAGATATTCAAGATAGATTAGGTCATG 

GTACGGCTGTTGTAGCTCAAATTGTAGGGGATGACAATATTAATGGAGTAAATCCTCACGTTAATATTAACGTCTA 

TAGAATATTTGGTAAGTCGTCAGCTAGTCCAGATTGGATTGTAAAAGCAATTTTTGATGCTGTAGATGATGGCAA 

GATATTATCAATCTTAGTACTGGACAATATTTAATGATTGATGGAGAATATGAGGACGGAACAAATGATTT^ 

CATTTTTGAAGTATAAAAAGGCTATTGATTACGCGAATCAAAAAGGAGTAATTATAGTAGCTGCAT^ 

ACTCCCTAAATGTATCAAATCAGTCAGATTTATTGAAACTTATTAGTTCACGCAAAAAAGTAAGA^ 

AGTAGTTGATGTTCCAAGTTATTTCTCATCTACAATTTCGGTCGGAGGCATAGATCGCTTAG 

GTAATTTATCAGATTTTAGCAATAAAGGGGATTCTGATGCAATATATGCGCCTGCAGGCTCAACATTATCT^^ 

AGAATTAGGACrrAATAACITTATTAATGCAGAAAAATATAAAGAAGATTGGATT^^ 

ATATACGTATCTTTATGGAAACrCATTTGCTGCTCCTAAAGTTTCTGGTGCG 

AATTAAAAGATCAGCCCTATAATTATATGTTTGTAAAAAAATTCTGGAAGAAACATTACCA 

4106.1 

ATGAAGAAAACATGGAAAGTGTTTTTAACGCTTGTAACAGCTCITGTAGCTGTTGTGC^ 

GAACTGCITCTAAAGACAACAAAGAGGCAGAACTTAAGAAGGTTGACTTTATCCTAGACTGGACACC/^ 

ACCACACAGGGCTTTATGTTGCCAAGGAAAAAGGTTATTTCAAAGAAGCTGGAGTGGATGTTGATTTGA^ 

CACCAGAAGAAAGTTCTTCTGACTTGGTTATCAACGGAAAGGCACCATTTGCAGTGTATTTCCA^ 

TAAGAAATTGGAAAAAGGAGCAGGAATCACTGCCGTTGCAGCTATTGTTGAACACAATACATCAGGAATCATCTC 

TCGTAAATCTGATAATGTAAGCAGTCCAAAAGACITGGTTGGTAAGAAATATGGGACATGGAATGACCCAACTGA 

ACnTGCTATGTTGAAAACCTTGGTAGAATCTCAAGGTGGAGACTTTGAGAAGGTTGAAA^ 

CTCAAACTCAATCACACCGATTGCCAATGGCGTCTTTGATACTGCm'GGATTTACTACGGTTGGG^^ 

GCTAAATCTCAAGGTGTAGATGCTAACITCATGTACITGAAAGACTATGTCAAGGAGTTTGACTA 

TTATCATCGCAAACAACGACTATCTGAAAGATAACAAAGAAGAAGCrCGCAAAGTCATCCAAGCCATCAAAA^ 

GCTACCAATATGCCATGGAACATCCAGAAGAAGCTGCAGATATTCTCATCAAGAATGCACCTGAACTCAAGGAAA 

AACGTGACITTGTCATCGAATCTCAAAAATACTTGTCAAAAGAATACGCAAGCGACAAGGAAA^ 

TTGACGCAGCTCGCTGGAATGCTTTCTACAAATGGGATAAAGAAAATGGTATCCTTAA^ 

AAGGCTTCACCAACGAATTTGTGAAATAA 

4106,4 

ATGATAAAAAATCCTAAATTATTAACCAAGTCTTTTTTAAGAAGTTTTGCAAT^ 

TTCATATAGCTATTTATTTGACCITTCCTTTTTATTATATTCAACTGGAGGGGGAAAAG^^ 

AGTGTTTACGGAGTATTTAAAGACTAAGACATCTGATGAAATTCCAAGCTTACTCCAGTCTTATTCAAAGT^ 

ACCATATCTGCTCACCTTAAAAGAGATATTGTAGATAAGCGGCTCCCTCTTGTGCATGACITGGAT^ 

GAAAGCTATCAAATTATATCGTGATGTTAGATATGTCTGTTAGTACAGCAGATGGTAAACAGGTAACCGTGCAATT 

TGTTCACGGGGTGGATGTCTACAAAGAAGCAAAGAATATTTTGCTTTTGTATCTCCCATATACATTT^ 

ATTGCITITTCCriTGTTTTTTCTTATTT^ 

AGTAAAATGCAAGATTTGGATGACAATATTCGTTTTGATGAAAGTAGGAAAGATGAAGTTGGTGAAGTTGGAA^ 

CAGATTAATGGTATGTATGAGCACITGTTGAAGGTTATTTATGAGTTGGAAAGTCGTAATGAGCAAAT^ 

TGCAAAATCAAAAGGTTTCCTTTGTCCGCGGAGCATCACATGAGTTGAAAACCCCITTAGC^ 

CCTAGAGAATATGCAGCATAATATTGGAGATTACAAAGATCATCCAAAATATATTGCAAAGAGTATAAATAAGAT 

TGACCAGATGAGCCACITATTAGAAGAAGTACTGGAGTCTTCTAAATTCCAAGAGTGGACAGAGTGTCGTGAGAC 

CTTGACTGTTAAGCCAGTTTTAGTAGATATTTTATCACGTTATCAAGAATTAGCTCATT 

AAAATCAATTGACAGATGCTACCAGGGTCGTCATGAGTCTTAGGGCATTGGATAAGGTTTTGACAAACCTGAT^ 

GTAATGCAATTAAATATTCAGATAAAAATGGGCGTGTAATCATATCCGAGCAAGATGGCTATCTCTCTATCAAAA 

ATACATGTGCGCCTCTAAGTGACCAAGAACTAGAACATTTATTTGATATATTCTATCATTCTCAAATCGTC 

TAAGGATGAAAGTTCCGGTTTGGGTCITTACATTGTGAATAATATTTTAGAAAGCrAT^ 

CTCCCTTATGAACACGGTATGGAATTTAAGATTAGCTTGTAG 

4106.6 

ATGTATTTAGGAGATTTGATGGAGAAAGCCGAGTGTGGTCAATTTTCAATACTTTCCTTTCTATTACAAGAGTCTC 

AGACGACCGTCAAGGCTGTAATGGAAGAAACAGGATTTTCAAAAGCAACCCTAACCAAATATGTCACCCTGCTCA 

ATGACAAGGCTTTGGATAGTGGCTTAGAGCTGGCTATTCACTCAGAAGATGAAAATCTGCGTCTGTCTATCGGTGC 

AGCTACCAAGGGGAGAGATATTCGGAGCTTGTTTTTGGAGAGTGCrGTTAAATACCAGATTTTGGT^ 

TACCACCAACAGTTTTTAGCCCATCAGCTGGCTCAAGAATTGGTGATTAGCGAGGCTACGCTTGGTCGTCACTTGG 

CTGGTTTAAATCAGATTTTGTCAGAATTTGATTTATCCATCCAAAATGGCCGTTGGCGAGGTCCAGAGCATCAGA 

TCACTATTTCTATTTCTGTCTTTTCCGAAAGGTCTGGTCGAGTCAGGAATGGGAAGGTCACATGCAGAAACCAGAG 

AGAAAACAGGAGATTGCCAATTTAGAGGAAATCTGCGGTGCAAGTTTGTCTGCGGGGCAGAAATTGGACITGGT^ 
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CTCTGGGCTCACATCAGTCAACAACGTCTTCGGGTCAATGCITGTCAGTTTCAAGTCATAGAAGAGAAAATGCGA 

GGGTATTTTGACAATATCTTTTATCTTCGTTTGCTGAGAAAGGTTCCGTCCr^^ 
AGGAGTTGAGGATGGTGAGATGATGATATTCTTCTCITTTCrCCTATCTCATCGCATTCTTCCTC^ 
AGTATATTCTTGGTTTTGGAGGGCAGTTGGCAGATTTACTGACGCAATTGATTCAAGAAATGAAGAAG 
5 TATTGGGGGATTATACAGAGGACCATGTCACCrATGAACTCAGTCAGCTTTGTGCTCAAGTCTATCTCTATAAGGG 
CTATATTTTACAGGATCGCTACAAGTACCAGTTAGAGAATCGTCATCCATATTTACTGATGGAACATGATTTTA^ 
GAGACAGCAGAGGAGATTTTTCATGCTCTACCTGCITTTCAACAGGGGACAGATTTAGATAAGAA^ 
AATGGCrCCAGTTAATCGAATATATGGCTGAAAACGGTGGCCAGCATATGCGGATTGGTCTGGATTTGACATCTGG 

TTTTCTTGTCTTTTCAAGGATGGCAGCCATTTTGAAACGGTATTTGGAATAC^ 
10 ATGACCCTAGTCGGCATTATGATTTGCTGGTTACCAATAACCCGATTCATAAGAAGGAACAGACACCAGTCTATTA 

TTTAAAAAATGACTTGGATATGGAGGATTTGGTAGCGATTCGCCAGTTATTATTCACTTAA 
4106.7 

ATGGAATTTTCAAAGAAAACACGTGAATTGTCAATTAAAAAAATGCAGGAACGTACCCrGGACCr(^ 
15 GGTGGAGGAATCACAGGAGCrGGTGTAGCCTTGCAGGCGGCAGCTAGCGGTCTTGAGACTGGTTTGATTGAAATG 
CAAGACTTTGCAGAAGGAACATCTAGTCGTTCAACAAAATTGGTTCACGGAGGACTTCGTTACCT^ 
ACGTAGAAGTGGTCTCAGATACGGTTTCTGAACGTGCAGTGGTTCAACAAATCGCTCCACACATTCCAAA^ 
ATCCAATGCTCTTACCAGTTTACGATGAAGATGGAGCAACCTTTAGCCTCITCCGTC^ 

GTACGACCTCnTGGCAGGTGTTAGCAACACACCAGCTGCGAACAAGGTTTTGAGCAAGGATCAAGTCTTGGi^ 
20 CCAGCCAAACTTGAAGAAGGAAGGCTTGGTAGGAGGTGGAGTGTATCTTGACTTCCGTAACAACGATC 

CGTGATTGAAAACATCAAACGTGCCAACCAAGACGGTGCCCTCATTGCCAACCACGTGAAGGCAGAAGGCTTCCT 
CTTTGACGAAAGTGGCAAGATTACAGGTGTTGTAGCTCGTGATCTCTTGACAGACCAAGTGTTTGA^ 
CGTCTGGTTATTAATACAACAGGTCCTTGGAGTGATAAAGTACGTAATTTGTCTAATAAGGGAACGCAATTCTCAC 
AAATGCGCCCAACTAAGGGAGTTCACTTGGTAGTAGATTCAAGCAAAATCAAGGTTTCACAGCCAGTTTAC^ 
25 ACACAGGTTTGGGTGACGGTCGTATGGTCTTTGTTCTCCCACGTGAAAACAAGACITACmTGGTACAACT 

AGACTACACAGGTGATTTGGAGCATCCAAAAGTAACTCAAGAAGATGTAGATTATCTACTTGGCATTGTCAACAA 
CCGCTTCCCAGAATCCAACATCACCATTGATGATATCGAAAGCAGCTGGGCAGGTCTTCG 

TCCATTGATTGCAGGGAACAGTGCCTCTGACTATAATGGTGGAAATAACGGTACCATCAGTGATGAAAGCTTTGA 
CAACTTGATTGCGACTGTTGAATCTTATCrCTCCAAAGAAAAAACACGTGAAGATGTTGAGTCTGCT 
30 CITGAAAGTAGCACATCrGAGAAACATTTGGATCCATCrGCAGTTTCTCGTGGGTCTAGC^ 

ATGGTCTCTTGACTCTTGCTGGTGGTAAAATCACAGACTACCGTAAGATGGCTGAAGGAGCTATGGAGCGCGTGG 
TTGACATCCTCAAAGCAGAATTTGACCGTAGCTTTAAATTGATCAATTCTAAAACI^ 

ATTGAACCCAGCAAATGTGGATTCAGAAATCGAAGCCmTGCGCAACTTGGAGTATCACGTGGTTTGGATAGCA^ 
GGAAGCTCACTATCTGGCAAATCTTTACGGTTCAAATGCACCGAAAGTCTTTGCACTTGCTCACAGC^ 
35 GCGCCAGGACrCAGCTTGGCAGATACTTTGTCCCITCACTATGCAATGCGCAATGAGTTGACTC^ 
ACnrCCTTCTTCGTCGTACCAATCACATGCrCITTATGCGTGATAGCTTGGATAGTATCG^ 

GAAATGGGACGATTCTATGACTGGACAGAAGAAGAAAAAGCAACTTACCGTGCTGATGTCGAAGCAGCTCTCGCT 
AACAACGATTTAGCAGAATTAAAAAATTAA 

40 4106.8 

ATGATGAATGAATTATTTGGAGAATTTCTAGGGACTTTAATCCTGATTCTTCTAGGAAATGGTG^ 
TGGTTCTTCCTAAAACCAAGAGCAATAGCTCAGGTTGGATTGTGATTACTATGGGTTGGGGGATTGCAGTTGC^ 
TGCAGTCTTTGTATCTGGCAAGCTCAGTCCAGCTTATTTAAACCCAGCTGTGACCATCGGTGTGGCC n^ 
GGTTTGCCTTGGGCTTCCGTTTTGCCITATATCTTAGCCCAGTTCGCAGGGGCCATGCTGGGTC^ 
45 GTTGCAATTCAAACCTCACTATGAGGCAGAAGAAAATGCAGGCAATATCCTGGCAACCTTCAGTACTGG ACCAG 
CATCAAGGATACTGTATCAAACTTGATTAGCGAAATCCTTGGAACTTTTGTTTTG^^ 
GTCnTTACGACTTTCAGGCAGGTATCGGAACCTTTGCAGTGGGAACTTTGATTGTCGGT^ 

TGGGACAACAGGTTATGCCTTGAACCCAGCTCGTGACCTTGGACCTCGTATCATGCACAGCATCTTGCCAATTCC^ 
AACAAGGGAGACGGAGACTGGTCTTACGCTTGGATTCCTGTTGTAGGCCCTGTTATCGGAGCAGCCTTGGC 
50 CITGTATTCTCACTTTTCTAG 

4106.10 

ATGAAAAGGACCTGGAGGAACTCATTCGTGACAAATCTTAATACACCmTTATGATTGGCAATATTGAGATTCCCA 
ATCGTACCGTTTTAGCGCCTATGGCTGGCGTGACCAACTCAGCCTTTCGTACTATCGCAAAGGAGCTCGGAGCT 

55 ACTCGTTGTAATGGAAATGGTCrCTGACAAGGGAATCCAATACAACAACGAAAAAACCCTGCACATGCTTCATAT 
CGATGAGGGCGAAAACCCTGTCTCTATCCAA(mTTTGGTAGCGATGAAGACAGCCTAGCACGCGCAGCAGAATT 
CATCCAAGAAAACACCAAGACCGATATCGTCGATATCAACATGGGCTGCCCTGTCAACAAAATCGTGAAGAACGA 
AGCTGGTGCTATGTGGCTCAAGGATCCAGACAAGATTTACrCCATCATCAACAAGGTCCAGTCTGTCCTTGATATC 
CCACTTACTGTCAAAATGCGTACCGGCTGGGCGGACCCATCTCnTGCAGTAGAAAATGCTCTCGCTGCTGAAGCTG 

60 CAGGTGTTTCTGCCCTCGCCATGCATGGCCGTACCCGTGAACAAATGTATACTGGCCACGCAGACCTTGAGACCCT 
TTACAAGGTTGCCCAAGCTCTAACCAAGATTCCATTCATCGCCAACGGTGATATCCGTACTGTCCAAGAAGCCAA 
GCAACGCATCGAAGAAGTTGGTGCTGACGCAGTCATGATTGGCCGAGCTGCCATGGGAAATCCTTACCTCTTCAA 
CCAAATCAACCATTACTTTGAAACAGGAGAAATCCTACCTGATTTGACCTTTGAAGACAAGATGAAGATCGCCT 
CGAACACTTGAAACGATTGATTAACCTCAAAGGAGAAAACGTCGCAGTTCGTGAATTCCGCGGTCTCGCTCCTCA 
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CTATCTCCGTGGAACATCTGGCGCTGCCAAACrCCGTGGAGCCATTTCGCAAGCCAGCACCCTGGCAGAGATTGA 
AACCCTCTTGCAATTGGAGAAGGCTTAA 



4107.1 

ATGACAAAGAAGAAAATTGAGCGTATTTCTGTAATACACCGAGAAAAGATTTTATGGCTCAAGTGGTAT^ 

GAGATAAAGAACAACCTAAGTATAGTGTCCTTGAGCGTAAAATGTTTGATGCTGCTAAAAATCAAG ATATG CTAG 

CTTATCAAAAATACGCAACTATCAAGCAGATAACAGATATTAGGGTACAAACAAGTGAGGCTGACATTTTAGAGG 

CTGTAAAAGAGGTTTATGTGTACAATCACATGAATGTTATCGGAGCITGTCAGCGGATATTATTTATCAGTCAATC 

ACCAGCITATGATAAGTTAAATAAGTGGTTTAATATCrATTCTGATTTGTATTTT^^ 

TGGGGGTATATCATGAGATGGTAGGTATCTAG 

4107,2 

ATGAAAAATTCCAACGAGGCTGAGATGAAATTACTTTATACTGATATTCGGACTTCTTT GA 

GAGAGGCAGAAGAGCTAGTTGCAGCTGGCAAGCGGGTCTTCTACATTGCCCCCAACrcra 

ACGCGCCGTGCTGGAATACITGTCCCAGCAGGCTTCITTTTCGATTACCGTCACGCGC^^ 

TATCTGGTCITGAATGATTTACCAGCTAAAACTACTCnTGATGATATCGGTCTTGGGT^ 

TGCCGAACTCGATCCCAAGGACTTGCGTGTTTATGGCGCTATTAAGCAGGATCCTCAATTGATCCAG 

GAGCITTACCATGAGATGACCAAATCTCAGATGAGTTTTTTGGACTTGGAG 
GCGGATTTACTCITGATTTTTGAGAAAGTAACAGCCTATCTTAATCAAGGTCAGT^ 

CCCATTTGATTGAGGCTATTGAGAATGACAAGGTAAGTAGTGATTTTAATCAAATCGCCTTGGTCATTGACGGC^ 

TACTCGTTTTTCTGCTGAGGAAGAGCGGGTTGTGGACTTACTTCACGGCAAAGGTGTTGAGATTGTTA 

TATGCTAGTAAGAAAGCCTATACCAGTCCTTTTAGCGAGGGCAATCTCTACCAAGCCAGCGTAAAATTTCT 

ATCTGGCTTCrAAATACCAAACGCCTGCTCAGGACTGTTCrCAAACTCATGAGAAGATGGATAGTTTTG 

CTCTCGTTTGTTGGAGTCITCTTATGACTTTTCAGAACTCGCTTTGGATC 

CAAATCTGGTCTTGTTTGACGCAAAAGGAGGAGTTGGAGCTAGTAGCCCGTAGTATTCGTCAGAAATTACATGAG 
AACTCAGACCTGAGCTACAAGCATTTTCGTATTCrCITGGGGGATGTAGCTTC^ 
TTTTTGACCAGTATCAGATTCCTTTTTATCTTGGTAGAAGCGAAGCCATGGCTCATCATCCCTTGACT 
GAGTCTATTTTAGCTTTAAAACGTTACCGTTTTCGTCAGGAGGATTTGATTAATC^ 

TGACCTCAGTCAGTCTGATATTGATGCTTTTGAGCAATATATCCGCTATCTTGGTATCAATGGCTTGCCAG^ 

AGCAAACCITCACCAAATCCCACCATGGAAAATTTAATCTTGAGCGTTTGAATGTCCTCCGCCTC 

ACCTCITGAAACCCTCrTTGCCAGCCGAAAACAAAAGGCTGAAAAACTCCTACAA^^ 

AGAAGGAGCTGTGACCAAGCAGTTACAAGATTTGACAACCACTTTGGAAGCTGTAGAACAGGAAAGACAAGCCG 
AAGTTTGGAAGGCTTTCTGCCATGTTTTAGAACAATTTGCGACTGTTTT^ 

CTTCCTAGCCTTGCTCCATTCTGGAATGAGTTTGTCCCAATACCGTACCATTCCAGCAACAGTGGACACTGTTCT 

GTGCAGAGTTACGATTTGATTGCACCATTGACTGCTGACTTTGTCTATGCTATTGGACTAACTCAGGACAAT^ 

CAAAAATTTCTCAAAACACCAGTCTTCTGACAGATGAAGAAAGGCAAAACCTAAACCAAGCGACCGAAGAAGGC 

GTTCAATTACTGATTGCCAGCAGTGAAAATCTCAAGAAAAATCGCTACACTATGCnTTCC^ 

GTAAGCAGTTGTTCTTGTCGGCTCCAAGCCTTTTTAACGAAAGTGAAAGTAAGGAATCTGC 

GATCCATTTTGGATTTAGGCGGAGAGAGAAGAGGATGAATCACAAAGGACTGTCTAAGGAGGATATGGGG TCCTA 

TCACAGTCTTTTGTCTAGTCTGGTTGCCrATCACCAGCAGGGTGAGATGAGCGATACTGAGCAAGATTTGAC^^ 

GTCAAGGTTCTGTCGCGTGTCATAGGTAAAAAACTAGATCAGCAAGGTCTGGAAAATCCAGCTATCCCAACCAGT 

CCAAGCAGCAAGACCTTAGCCAAGGACACCTTGCAAGCTCT 

CTATCCTGCCAAACAGGAGTTTTACCrGTCTACGTCGGGTTTGACAGAGTTTTATCG 

CTACGCTACGTTTTAGGCTTGCAGGAGGAATTACGTTTGCATCCTGATGCCCGTAGTCACGGGAAT^ 

GTATCTTTGAACGCGCCITACAGTTGCCrAATGAAGATTCCITTGACCAACGT 

CAGTCAAGAACGCGAATTTGAAGCTATTTATCAAGAAAGTTTGGAAGCCCAGTTTACCAAGGAAGTT^ 

GTTGCACGGACAACTGGACATATTCTCCGACACAATCCAGCCATCGAAACCATCAAAGAAGAAGCAAATTTTGGT 

GGAAAAGACCAAGCCTTTATTCAATTAGACAATGGACGCAGTGTCTTTGTACGAGGCAAGGTGGACCGGATTGAC 

CGTTTGAAAGCTAATGGAGCGATAGGAGTAGTAGACTACAAATCCAGTCTGACrCAGTTCCAGTTTCCTCATTTCT 

TTAATGGGCrCAATTCTCAGTTACCAACCrATCTTGCTGCCCTAAAAAGAGAAGGGGAGCAGAACT^^ 

CATGTACTTGGAAATGGCTGAACCTGTCCAATCTCTGATGGCGGTAAAAAGTCTGGCAGGAGCAGTGGTAGAAGC 

CAGCAAATCTATGAAATACCAAGGGCTCTTCTTGGAAAAAGAAAGCAGTTATTTAGGCGAATTT^ 

CAAGGCTAATCAACTGACAGATGAGGAATTTCAGCTCCTACTGGACTACAATGCCTATCITTACAAGAAAGCTGCT 

GAGAAGATTTTAGCAGGCCGGTTCGCCATCAATCCrTATACTGAAAATGGCAGAAGCATTGCCCCATACG^ 

CAACATCAGGCTATTACAGGCTTTGAAGCCAATTACCATCTGGGCCAAGCCCGTTTCCTAGAAAAGTTG^^ 

CTGATGGCAAGCGTCTGGTCGGAGAAAAACTCAAGCAAGCTTGGCTTGAAAAAATAA 

GAGAGGAGTTGAATCGATGA 

4107.3 

ATGAAGCTTATTCCCTTTTTAAGTGAGGAGGAGATTCAAAAACTGCAAGAAGCAGAAGCAAATTCGAGCAAGGAA 

CAGAAGAAAACTGCCGAGCAAATCGAAGCTATCTACACTTCTGCCCAGAATATCCTGGTCTCAGCATCGGCTGGT 

TCTGGAAAGACCITTGTCATGGCAGAGCGCATTCTGGACCAATTGGCGCGTGGTGTCGAAATTTCTCAACrCT^ 
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TCTCAACCTTTACCGTCAAGGCTGCAACTGAACTTAAAGAACGTTTAGAGAAAAAAATCAGCAAGAAA^ 
AAACAGATGATGTCGACCTCAAACAACACTTGGGTCGCCAGTTGGCAGACCTACCCAACGCTGCCATTGGAACCA 
TGGATTCTTTCACACAAAAATTCCTTGGCAAACATGGTTATCTGCTTGATATTGCACCTAATTTC 
AACCAAAGCGAGCAACTTATTCTCGAAAACGAAGT(mTCATGAGGTCrnn^ 
5 AAAGAGACCTTTAGTCATTTGCTGAAAAACTTTGCTGGGCGTGGCAAGGACGAACGGGGTCT 

TATAAAATCTATGACTTCCTCCAATCCACCAGTAATCCTCAAAAGTGGCTGAGTGAATCTTTCCTCAAAGGAT^ 
AGAAAGCTGATTTTACCAGTGAAAAAGAAAAACTGACCGAGCAAATCAAACAAGCCCTTTaGGA^ 
TTTTCCGTTACCATCTGGATAACGATGCCAAGGAGTTTGCAAAGGCTGCCTATTTAGAAAATGT^ 
GGATGAAATTGGCTCCCTAAATCAGGAGTCCGATAGTCAGGCTTATCAGGCAGTGCTTGCGCGTGTTGTCGCCATC 

10 TCTAAGGAGAAAAACGGTCGAGCTCTGACTAATGCCAGCCGTAAGGCTGATTTGAAGCCCCTGGCTGATGCCTA 

AACGAAGAGAGAuAAGACCCAGTTTGCTAAACTAGGACAATTATCAGACCAGATAGCGAT 

TCTCGACTATCAAGAACGTTATCATGGAGACACTTGGAAACTAGCTAAAACCTTCCAATCT^ 

GTAGAGGCTTATCGTCAGAGAAAACGACAGGAAAATGCCTTCGAATTCGCTGATATCAGCCATTACACCATTGAG 

ATTTTAGAGAATTTCCCACAAGTTCGTGAGTCTTATCAGGAGCGCITCCATGAAGTCATGGTCGATGAG 

15 ATACCAACCATATTCAAGAACGGATGCTGGAATTGTTGTCTAATGGCCACAATCGCITTATGGTG 

GCAATCCATCTATCGTTTCCGTCAGGCAGACCCGCAGATTTTCAATGAGAAATTCCAACGCTATGCGC>^^ 
CAAGAAGGCAGGCTCATTATCCTCAAGGAAAATTTCCGTAGTAGTTCAGAAGTGCTGTCAG CAACC AATGATGTC 
TTTGAACGTCTCATGGACCAAGAGGTCGGCGAAATCAACTATGATAACAAGCACCAGCTTGTTT^ 
AACTGACTCCCAATCCAGACAACAAGGCAGCATTTCTCCrCTACGACAAGGACGATACAGGTGAGGAAGAAGAGA 

20 GTCAAACAGAAACGAAACTAACAGGCGAAATGCGCTTAGTTATCAAGGAGATTCTGAAACTTCATC^ 
GTGTTGCCnTTAAGGAAATTGCCCTTCTGACCTCCAGCCGCAGTCGTAATGACCAGATTCTCCTCGCCCT 
GTACGGAATTCCTGTCAAAACTGACGGAGAGCAAAACAATTATCTCCAATCCCTAGAAGTGCAAGTCATG CTAGA 
CACTCTTCGTGTCATTCACAATCCCCTGCAAGACTACGCCTTGGTTGCCCTTATGAAGTCTCCAATGTT^ 
ATGAGGATGAGCTAGCACGTTTGTCCCTTCAGAAAGCAGAGGATAAAGTCCACGAAAATCTCTATGAGAAACTGG 

25 TCAATGCACAAAAAATGGCAAGTAGTCAAAAAGGCTTGATTCACACAGCTCTAGCTG 

AAAAACTAAAGCAATTCATGGATATCCTAGCTTCTTGGCGCTTGTATGCCAAAACCCACTCTCT 
TTGGAAGATTTACAACGACCGTTTTTATTATGACTATGTTGGGGCTTTGCCGAATGGTCCT 
CTCTATGCCCTAGCACTGCGTGCTGATCAATTTGAAAAGAGCAATTTCAAAGGTTTGTCGCGTT^ 
TTGACCAAGTCITAGAAGCCCAGCACGATTTGGCAAGCGTGGCCGTCGCACCGCCAAAAGATGCAGTAGAGCTCA 

30 TGACCATCCACAAGAGTAAAGGGCTGGAGTTTCCTTACGTCTTTATCCTCAATATGGATCAAGATTTC^ 

AGACTCTATGTCAGAAGTCATTCTCAGTCGTCAGAATGGTCTTGGTGTCAAATATATTGCCAAGATGGAGACAGGG 
GCAGTAGAAGACCACTATCCTAAuAACCATCAAACTCTCCATTCCTAGTCrGACCTATAGGCAGAACGAAGAGGAA 
TTACAGCTAGCAAGCTATTCTGAGCAGATGCGTTTGCTGTATGTTGCTATGACGCGGGCTGAGAAAAAGCTCTA^ 
TTGTCGGCAAGGGTTCTCGTGAAAAGCTGGAATCCAAGGAATACCCAGCAGCCAAAAATGGGAAACTAAATAGCA 

35 ATACTAGACTGCAAGCACGGAATTTCCAAGATTGGCTTTGGGCTATCAGTAAAGTGTTTACrAAGGAC^ 

ACTTTAGTTATCGTTTTATTGGCGAAGATCAGTTGACCAGAGAAGCTATCGGAGAGTTGGAAACCAAGAGTCCTCT 
CCAAGATAGCTCCCAAGCAGACAATCGTCAGTCAGATACCATCAAAGAAGCTCTGGAAATGCTGAAGGAGGTGGA 
AGTTTATAATACTCTTCACCGCGCAGCTATTGAACTTCCTAGTGTTCAAACCCCAAGTCAAATCAAG^^ 
GAACCAGTTATGGATATGGAAGGTGTCGAGATTGCTGGTCAAGGTCAGTCAGTAGGCAAGAAAATCAGCTTCGAT 

40 TTGCCAGATTTTTCAACCAAAGAAAAGGTAACTGGAGCTGAGATTGGTAGTGCTACTCACGAACT 

ATTGACCTCAGCCAGCAACTAACCCnTGCTAGCCTAACAGAAACACTCAAACAAGTTCAAACTAGCCA^ 
AGAGACAAGATCAATCHTGATAAAATTCnTGCTTTCTTTGACACAGTACrCGGTCAGGAAA 
ACCATCnTTATCGCGAGCAACCTTTCTCCATGCTCAAACGAGACCAAAAGAGTCAGGAAGACT^ 
TATCCITGATGGCrATCTGCITTACGAAAACAAAATTGTTCTGTTCGACTA^ 

45 AGTCAACTCGTAGACCGCTATCGTGGTCAGTTAGCTCTATACGAAGAGGCTTTATCACGAGCCTATTCGATTGAAA 
ATATTGAAAAATACITGATTTTACTCGGTAAAGACGAGGTTCAAGTTGTAAAAGTATAA 

4109.1 

ATGGAACTTGCTCGCCATGCTGAAAGCrTGGGAGTAGATGCTATTGCAACGATTCCACCAATTTATTTC 
50 CAGAATACTCAGTTGCCAAATACTGGAACGATATCAGTTCTGCAGCTCCAAACACAGACTACGTGATTTACAACA 
TTCCTCAATTGGCAGGGGTTGCnTTGACTCCAAGCCTTTACACAGAAATGTTGAAAAArCCTCGTG^ 
GAAGAACTCTTCTATGCCAGTTCAAGATATCCAAACCTTTGTCAGCCTTGGTGGAGAAGACCATATCGTC^ 
GGTCCTGATGAGCAGTTCCrAGGAGGACGCCTCATGGGGGCTAGGGCTGGTATCGGTGGTACTTATGGTGCTATGC 
CAGAACTCITCITGAAACTCAATCAGTTGATTGCGGATAAGGACCTAGAAACAGCGCGTGAA^ 
55 CAACGCAATCATTGGTAAACrCACTTCTGCTCATGGAAATATGTACGGTGTCATCAAAGAAGTCTTGAA/^ 

GAAGGCITGAATATTGGATCTGTTCGTTCACCATTGACACCAGTGACTGAAGAAGATCGTCCAGTTGTAGAAGCG 
GCTGCTGCCTTGATTCGTGAAACCAAGGAGCGCTTCCTCTAA 

4110.2 

60 ATGTATAAGACAAAGTGTTTACGAGAGAAGTTAGTATTATTTTTAAAAATTTTCTTCCC^ 

TTGCCAATTATTCTGCCTCTTTTGTTGATACTGCAATGACAGGTCAATACAACACTATGGAC^ 

ATGGCAACCAGTATCTGGAATCCTTTCTTTACATTTCTAACAGGGATTGTGTCAGCCITGGTGCCTAT^ 

CCATCTTGGTCGAGGCAAAAAGGAAGAAGTTGCGTCTGATTTTTACCAATTTATTTATT^ 

GTGGTCTTGCTGGGGATGGTACnTTTCnTGGCACCAATAATCTTGAATCATATTGGGTTAGAAGCAGCAGTAGCGG 
65 CAGTAGCGGTTCGCTATCTTTGGTTTTTATCTATCGGGATTATCCCCTTGTTGCTCTTTAGCGTCATTCGTTC 
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TGGATTCGCTGGGCnTGACCAAACTGTCCATGTACCTCATGCTTTTGTTACTCCCTCTCAATA^ 
CTCTTGATTTACGGTGCCTTTGGTGTTCCAGAACTGGGAGGGGCTGGTGCTGGTTTAGGAACATCCTT 

GGGTCTTGCTTGGGATTTCTGTTCTGGTTTTATTTAAACAGGAGAAGCTCAAAGCCT^ 
TCCACTTAATATGGATAAAATTAAGGAAGGAGrrCGTTTAGGTCTGCCTATTGGGGGAACTGTCITCGC 

5 GCTATCTTTTCAGTGGTTGGCITGATTATGGCTAAGTTTTCGCCCTTGATTATAGCT 

(nTTTCAAGTCTTATGTACGCCTTTCCTATGAGTATCTCATCGGCTATGGCTATTGTCGTT^ 

CCAAGCGATTTGATGATGCGAAAACCTATATTGGTCTAGGAAGATGGACTGCCCTCATTTTTGCGGCCTTCACCCT 
AACCTTCCTTTACATTTTTAGGGGAAATGTGGCCAGTCTTTATGGTAACGACCCAA^ 
CGTTTTTTAACTTATAGTCTTTTCTTCCAGTTAGCAGATACCTTTG 
10 TAAGGATACAGTTATTCCTTTTTACCTTGGTTTGCTTGGTTATTGGGGCGTAGCAATCCCTGTGT^ 

4112.2 

ATGAGTACTTTAGCAAAAATAGAAGCGCTCTTGTTTGTAGCGGGTGAAGATGGGATTCGGGTCCGCCAGTTAGCT 
GAACTCCTCTCTCTGCCACCGACAGGCATCCAGCAAAGTTTAGGAAAATTAGCCCAGAAGTATGAAAAGGACCCA 
15 GATTCCAGTTTGGCTTTGATTGAGACAAGTGGTGCITATAGATTGGTGACCAAGCCTCAATTTGCAGAGAT^ 
AGGAATACTCTAAGGCGCCTATCAACCAGAGCTTGTCTCGGGCrGCCCTTGAGACCTTGTCCATTAT^^ 
ACAGCCGATTACGCGGATAGAAATTGATGCCATCCGTGGAGTTAACTCGAGTGGAGCCTTGGCAAAGTTGCAGGC 
TTTTGACCTGATAAAGGAAGACGGGAAAAAGGAAGTATTGGGGCGCCCCAACCTCTATGTGACTACGGATTAT^ 
CCTAGATTACATGGGGATAAACCATTTAGAAGAATTACCAGTGATTGATGAGCrTGAGATTCAAGCCCAAGAA^ 

20 CCAATTATTTGGTGAAAGGATAGAAGAAGATGAGAATCAATAA 
4113.1 

ATGGATACGATGATTAGTAGATTTTTTCGCCATTTATTTGAAGCCTTAAAAAGT^ 
CAGTAGCTGCTGTCAGTTCAGTCATGATTACITTGACCTTGGTGGCAATATTTGCATCTGT^ 
25 AAACTAGCTACAGATATTGAAAATAATGTCCGTGTAGTAGTTTATATCCGAAAGGATGTGGAAGATAATAGTCAG 
ACAATTGAAAAAGAAGGTCAAACTGTTACAAATAATGACTACCACAAGGTATATGATTCTTTGAA 
ACGGTTAAAAGTGTTACCTTTTCAAGTAAAGAAGAACAATATGAAAAATTAACCGAGATA^ 
AAAATCITTGAAGGAGATGCCAATCCTCTCTATGATGCCTATATTGTAGAGGCAAACACTCCAAATGATGTAA 
ACTATAGCCGAAGATGCTAAAAAAATTGAAGGTGTCTCTGAGGTTCAAGATGGCGGTGCCAATACAGAAAGACTC 

30 TTCAAGTTAGCTTCATTTATCCGTGTTTGGGGACTAGGGATTGCTGCTTTGTTAATTT^ 

TTCAAATACCATTCGTATTACCATTATTTCCCGCAGTCGCGAAATTCAAATCATGCGCTTGGTCGGAGCTAAAA^ 
AGTTATATCCGTGGACCGTTCTTGTTAGAAGGAGCCTTTATCGGTTTATTGGGAGCTATCGCACCATCTGTT^ 
CTTTATTGTTTATCAAATTGTTTACCAATCTGTCAACAAATCGTTGGTAGGGCAAAATCTATCCA^^ 
GATTTATTTAGTCCGTTGATGATTGCCCTACTATTTGTGATTGGGGTTTTCATTGGT^ 

35 CATGCGCCGATTCTTGAAGATTTAG 

4117.1 

ATGAAGAAAGTAAGATnTATTTTTTTAGCTCTGCTATTTTTC^ 

GTACnrGGCAAGGAAAACAGTATCTGAAAGAAGATGGCAGTCAAGCAGCAAATGAGTGGGTTTTTGATACTCATT 
40 ATCAATCTTGGTTCTATATAAAAGCAGATGCTAACTATGCTGAAAATGAATGGCTAAAGCAAGGTGACGACTA 

TTACCTCAAATCTGGTGGCTATATGGCCAAATCAGAATGGGTAGAAGACAAGGGAGCCTTTTATTATCTTGACCA^ 
GATGGAAAGATGAAAAGAAATGCTTGGGTAGGAACTTCCTATGTTGGTGCAACAGGTGCCAAAGTAATAGAAGAC 
TGGGTCTATGATTCTCAATACGATGCTTGGTTTTATATCAAAGCAGATGGACAGCACGCAGAGAAAGAATGGCTC 
CAAATTAAAGGGAAGGACTATTATTTCAAATCCGGTGGTTATCTACTGACAAGTCAGTGGAT TAAT CAAGCTT 
45 TGAATGCTAGTGGTGCCAAAGTACAGCAAGGTTGGCTTTTTGACAAACAATACCAATCTTGGTT^ 
AAATGGAAACTATGCTGATAAAGAATGGATTTTCGAGAATGGTCACTATTATTATCTAAAATCCGGT 

GCAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTATCTCAAATTTGATGGGAA^ 
TGGGTCTACGATTCTCATAGTCAAGCITGGTACTACTTCAAATCCGGTGGTTACATGACAGCCAATGAATGGA 

GGGATAAGGAATCTTGGTTTTATCTCAAATCTGATGGGAAAATAGCTGAAAAAGAATGGGTCT 
50 GTCAAGCTTGGTACTACTTCAAATCCGGTGGTTACATGACAGCCAATGAATGGATTTGGGATAAGGAATCT^ 
TTACCTCAAATCTGATGGGAAAATAGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGG^ 
AAATCTGGTGGCTACATGGCGAAAAATGAGACAGTAGATGGTTATCAGCTTGGAAGCGATGGTAAATGGCTTGGA 

GGAAAAACTACAAATGAAAATGCTGCITACTATCAAGTAGTGCCTGTTACAGCCAATGT^ 

AAAAGCTTTCCTATATATCGCAAGGTAGTGTCGTATGGCTAGATAAGGATAGAAAAAGTGATGACAAGCGCTTGG 

55 CTATTACTATTTCTGGTTTGTCAGGCTATATGAAAACAGAAGATTTACAAGCGCTAGATGCT 

CCCTTATTATGAGAGTGATGGCCACCGTTTTTATCACTATGTGGCrCAGAATGCTAGTATCCCAGTAGCTTCTCAT 

CITTCTGATATGGAAGTAGGCAAGAAATATTATTCGGCAGATGGCCTGCATTTTGATGGTTTTAAGC^ 

CCirCCTTTTCAAAGATTTAACAGAGGCTACAAACTACAGTGCTGAAGAATTGGATA^ 
CATTAACAATAGCCTTTTGGAGAACAAGGGCGCTACTTTTAAGGAAGCCGAAGAACATTACCATATCAA 
60 TATCTCCTTGCCCATAGTGCCCTAGAAAGTAACTGGGGAAGAAGTAAAATTGCCAAAGATAAGAATAATTTCTTTG 
GCATTACAGCCTATGATACGACCCCTTACCITTCrGCTAAGACATTTGATGATGTGGATAAGGGAATT^ 
AACCAAGTGGATTAAGGAAAATTATATCGATAGGGGAAGAACTTTCCTTGGAAACAAGGCTTCTGGTATGAATGT 
GGAATATGCTTCAGACCCTTATTGGGGCGAAAAAATTGCTAGTGTGATGATGAAAATCAATGAGAAGCTAGGTGG 

CAAAGATTAG 

65 
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4119.2 

ATGAAAAAAGTATTACAAAAATATTGGGCATGGGCTTTTGTGGTCATCCCCCTCTTGTTACAAGCAATi ri^ 
ATGTGCCGATGTTTCAAGGAGCCTTTTACAGTTTTACCAACTGGACAGGATTGACTTATAACTACAAATT^ 
CTTAAACAACTTTAAGCTCCTCTTCATGGATCCAAAATTCATGAATGCGATTGGCITTACC 

GCCATGGTGGTTGGTGAGATTGCACTCGGGATCTTCATTGCGCGTGTCTTGAATTCTAAAATCAAAGGCCAAACCT 
TCTTCCGTGCITGGTTCITCTTCCCAGCTGTTTTATCTGGTTTGACAG 

TACGGTCTTCCAGCGATTGGAAATGCCCTTCATATTGAATTTTTCCAAACCAGTCTTTTAGGGACTA^ 

CAATCTTTGCGGCTGTCTTTGTCCTTCTTTGGCAAGGGGTGGCTATGCCCATCATCATCrrCCTAGCT^ 

TCTATTCCAACTGAGATTACAGAGGCAGCAAGGATTGATGGTGCGACTAGCAAGCAAGTTTTCT GGAA CATTGAA 

TTGCCTTACTTGCTACCAAGTGTCTCTATGGTCTTTATCCTAGCCCTAAAAGGTGGGCTGACTGCCT^ 

CTTTGCCATGACCGGTGGTGGTCCAAACAATGCCACAACCTCACTTGGGCTCITGGTTTATAACT 

AACAACCAATTCGGTTATGCCAATGCCATTGCCGTAATCTTGTTCTTCTTAATTGTAGTGATTTCGATCATCC 

GAGAGTATCTAAGAAATTTGAAATTTAA 

4119.3 

ATGATGAAACAAGATGAAAGAAAAGCCCTGATTGGCAAATACATTCTATTGATTCTAGGATCG^ 

TGCCGCTCCITGCTACCCTCTTTAGTTCCITCAAACCCACTAAGGATATTGTAGATAA 

AACTTCACATGGGACAACTTTAGCCGTCTCITAGCrGATGGGATTGGAGGCTATTATTGGAACT 

CrGTCTTGTCTTTACTTGCAGTAATGATCTTTATCCCTATGGCAGCCTACTCC^ 

AAAAGCCTTTACCATCATGTATACCCTCTTAATCCTCGGAATCTTCGTACCIT^ 

CGGTTATGATGAGTAAACTCGGTTTGGCTAATACCTTTGGTTTGATCITGCTCT 

ACCCTCTTTCTCTATGTTGGCTATATCAAAATCTCGATTCCAGAAAGTCTGGATGAAGCAGCAGAGATCGAT 

CTAATCAATTTACAACCTATTTCCGCATCATCITCCCAATGATGAAACCGATGCATGCGACAACCATGATCATC^ 

TGCCCTTTGGTTCTGGAATGACTTCATGTTGCCACTCCTTGTCTTGAACCGGGATTCCAAAATGTG 

TGTTCCAATACAACTACGCAGGCCAATATTTCAACGACTACGGACCAAGCTTTGCCTCTTACGTGGTCGGCATTAT 

CAGTATCACCATTGTCTATCTCTTCTTCCAACGCCATATCATTTCAGGAATGAGCAACGGGGCAGTGAAGTAA 

4119.4 

ATGAAAAGTATTCTTCAGAAAATGGGGGAGCATCCGATGCTGCTTCTTTTTCTTAG CTATAGTACT 

TTCTTGCACAAAATTGGATGGGTCTTGTGGCTTCAGTAGGAATGTTTCTATTTACTAlM'rr 

TCGATTTTATCCCATAAATTCTTTCGATTGATTTTGCAGTTTGTCTTGTTTGGTAGT^ 

AGTTTAGAACATTTCCAAATTGTGAAGAAATTTAACTATGCTTTTC^ 

ACCGGGCAGAAGTGACCTTCTTTAATCCTAATTATTATGGAATTATTTGTTGTTTCTGT 

CTGTTTACAACGACCAAGTTGAATTGGTTGAAAGTATTCTGTGTGATTGCAGGCIT TGTTA ATCT 

CTTTACTCAAAATCGAACTGCCTTTCCTGCTATTATCGCTGGAGCAATTATCTATCTCTT^ 

GGAAGGCCTTTTGGCTTAGTATTGGGGTCTTCGCGATTGGTTTGAGTTTCCTCT^ 

ATGGGTACTTTAGACTCnTCTATGGAAGAACGCATTTCTATCTGGGATGCrGGGATGGCCTTG^ 

CITTTTGGGGTGAAGGGCCATTGACCTATATGAACTCTTATCCTCGGATACATGC TCCTT ATCATGAAC^ 

CAGTCTTTATATTGATACGATTCTGAGTTACGGAATTGTGGGGACTATTTTATT AGTT TTGTC^ 

TTCGCTTGATGATGGATATGAGTCAGGAGTCGGGGAAACGTCCGATTATCGGCCT TTAT CTATCTTTCCTTACAGT 

GGTTG CTGTG C ACGG AATTTTTG ACTTGGCTCTCnTCTGG ATTC AGTC AGGCTTT ATT^ 
GTATTCCATTGGAGCATCGAATGTTGGTATCGGACATGACGGATTAA 

4120.1 

ATGTCAAAGATGGATGTTCAGAAAATCATTGCACCGATGATGAAGTTTGTGAATATGCGTGGCATTATAGCTCTAA 
AAGATGGGATGTTAGCAATTTTGCCATTGACAGTAGTTGGTAGTTTGTTCITGATTATGGGACAATTGC^ 
AGGATTAAATAAGAGCATTGCTAGTGTTTTTGGAGCTAATTGGACAGAGCCGTITATGCAAGTATAT^ 
TTTGCTATTATGGGTCTAATTTCnTGTTTTTCAATTGCCTATTC^ 

AGCTGGAGTTCTATCTGTATCTGCATTCTTTATTTTGCTAAGATCATCTTATATCCCTAAACAAGG^^ 

GGGACGCTATTAGTAAAGTTTGGTTTGGAGGCCAAGGAATTATCGGTGCTATCATTATAGGTTTGGTAGTAGGAAG 

TATTTATACCTTCTTTATAAAGAGAAAAATTGTTATTAAGATGCCAGAACAAGTTCCACAAGCTATTGCCAAACAG 

TTTGAAGCAATGATTCCAGCATTTGTAATTTTCTTATCTTCTATGATTGTATATATTTTAGCG 

TGGCGGAACATTCATAGAAATGATTTATTCTGCTATTCAAGTTCCGTTGCAAGGTTTAACTGGATCTTTGT^^ 

GCTATTGGAATTGCATTCTTTATATCATTTTTGTGGTGGTTTGGTGTTCATGGGCAATCGGTAGTAAATG 

GACAGCTCTGCTTTTATCTAATCTTGATGCTAATAAAGCTATGTTAGCCTCTGCTAATCTATCATTAGAAAATGGT 

GCACATATTGTTACTCAACAATTTTTAGATTCATTTTTAATTCTATCAGGTTCAGGGATTACGT^ 

TGCCATGCTTTTTGCAGCAAAATCAAAACAATACCAAGCCITAGGAAAAGTTGCAGCTT^ 

GTAAATGAGCCAGTTGTATTTGGATTTCCGATTGTCATGAATCCAGTTATGTTTGTACCTTTCATTCTTG^ 

ACTTGCAGCTGTGATAGTATATGGAGCTATTGCAACAGGTTTCATGCAGCCATTCTCAGGGGTAACATTGCCTTGG 

AGTACACCAGCTATTTTATCAGGATTTTTGGTGGGTGGATGGCAAGGAGTTATTACrCAGCTGGTGATATTAGCGA 

TGTCTACATTGGTTTATTTTCCATTCTTTAAAGTACAGGATCGTTTAGCTTACCAAAATGAAATC/^ 

4121.2 

ATGAAGAAAAAGGACTTAGTAGACCAACTAGTCTCAGAGATCGAGACGGGGAAAGTCAGGACACTGGGAATATA 
CGGTCATGGAGCTTCAGGTAAATCAACCTTTGCACAGGAATTGTACCAAGCTTTAGATTCTACTACAGTAAATTTG 
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CTAGAGACAGATCCTTATATCACCTCAGGACGCCATCTGGTAGTACCCAAGGACGCGCCGAATCAAAAGGTGACA 
GCCAGTCTGCCAGTGGCGCATGAACTGGAGAGTTTGCAGAGAGATATCCTTgCTTGCAGGCGGGTATGGATGTCTT 

OA 



4122.1 

ATGAAGAAAAGATACCrAGTCTTGACAGCTTTGCTAGCCTTGAGTCTAGCAGCTTGTTCACAAGAAAAAAC^^ 

AATGAAGATGGAGAAACTAAGACAGAACAGACAGCCAAAGCTGATGGAACAGTCGGTAGTAAGTCTCAAGGAGC 

TGCCCAGAAGAAAGCAGAAGTGGTCAATAAAGGTGATTACTACAGCATTCAAGGGAAATACGATGAAATCATCGT 

AGCCAACAAACACTATCCATTGTCTAAAGACTATAATCCAGGGGAAAATCCAACAGCCAAGGCAGAGTTGGTCAA 

ACTCATCAAAGCGATGCAAGAGGCAGGTTTCCCTATTAGTGATCATTACAGTGGTTTTAGAAGTTATGAAACT 

ACCAAGCrCTATCAAGATTATGTCAACCAAGATGGAAAGGCAGCAGCTGACCGTTACTCTGCCCGTCCTGGCTAT 

AGCGAACACCAGACAGGCTTGGCCITTGATGTGATTGGGACTGATGGTGATTTGGTGACAGAAGAAAAAGC 

CAATGGCTCTTGGATCATGCAGCTGATTATGGCTTTGTTGTCCGTTATCTCAAAGGCAAGGAAAAGGAAACAGGCT 

ATATGGCTGAAGAATGGCACCTGCGTTATGTAGGAAAAGAAGCTAAAGAAATTGCTGCAAGTGGTCTCAGTT^ 

AAGAATACTATGGCTTTGAAGGCGGAGACTACGTCGATTAA 

4125.6 

ATGCGTAAATTCTTAATTATTTTGTTGCTACCAAGTTTTTTGACCATT^ 
TCGTCTATACirCGAAAGAAATTTATTACCTTTCACAATCTGACTTT 

CCCATGGTTTATGGAGAGGTTCCTGTTTATGCGAATGAAGATTTAGTAGTGGAATCTGGGAAATTGACTC 
CAAGTTTTCAAATAACCGAGTGGCGCTTAAATAAACAAGGAATTCCAGTATTTAAGCTATCA^ 
AGCTGCGGACAAACGATTTTTATATGATCAATCAGAGGTAACTCCAACAATAAAAAAAGTATGGTTAGAATCT 
CITTAAACTGTACAATAGTCCnTATGATTTAAAAGAAGTGAAATCATCCT^ 

GACAAGACCATGTTTGTAGAAGGAAGAGAATTTCTACATATTGATCAGGCTGGATGGGTAGCTA AAGA ATCAACT 

TCTGAAGAAGATAATCGGATGAGTAAAGTTCAAGAAATGTTATCTGAAAAATATCAGAAAGATTCT^ 

ATGTTAAGCAACTGACTACTGGAAAAGAAGCTGGTATCAATCAAGATGAAAAGATGTATGCAGCCAGCGT^ 

AACTCTCTTATCTCTATTATACGCAAGAAAAAATAAATGAGGGTCTTTATCAGTTAGAT^ 

ATCTGCAGTCAATGATTTTCCAGGTTCTTATAAACCAGAGGGAAGTGGTAGTCTTCCTAAAAAAGAA 

AGAATATTCTTTAAAGGATTTAATTACGAAAGTATCAAAAGAATCTGATAATGTAGCTCATAATCT 

TACATTTCAAACCAATCTGATGCCACATTCAAATCCAAGATGTCTGCCATTATGGGAGATGATTGGGATCCAAAAG 

AAAAATTGATTTCITCTAAGATGGCCGGGAAGTTTATGGAAGCTATTTATAATCAA/^ 

TTTGACTAAAACAGATTTTGATAGTCAGCGAATTGCCAAAGGTGTTTCTGTTAAAGTAG 

CTCATAAAATTGGAGATGCGGATGAATTTAAGCATGATACGGGTGTTGTCTATGCAGATTCrCCATTTATTCm 
ATTTTCACTAAGAATTCTGATTATGATACGATTTCTAAGATAGCCAAGGATGTTTATGAGGTTCT/^^ 

4125.7 

ATGAAAAAACAAAATAATGGTTTAATTAAAAATCCTTTTCTATGGTTATTATT^ 

CCAGTATTTCTATTCTGGGAATAACTCAGGAGGAAGTCAGCAAATCAACTATACTGAGTTGGTACAAGAAATTAC 

CGATGGTAATGTAAAAGAATTAACTTACCAACCAAATGGTAGTGTTATCGAAGTTTCTGGTGTCTATAAAAA^ 

AAAACAAGTAAAGAAGAAACAGGTATTCAGTTTTTCACGCCATCTGTTACTAAGGTAGAGAAATTTACCA 

ATTCTTCCTGCAGATACTACCGTATCAGAATTGCAAAAACTTGCTACTGACCATAAAGCAGAAGTAACTGT^ 

ATGAAAGTTCAAGTGGTATATGGATTAATCTACTCGTATCCATTGTGCCATTTGGAATTCTATTCITCT^ 

TCTATGATGGGAAATATGGGAGGAGGCAATGGCCGTAATCCAATGAGTTTTGGACGTAGTAAGGCTAAAGCAGCA 

AATAAAGAAGATATTAAAGTAAGATTTTCAGATGTTGCTGGAGCTGAGGAAGAAJUWVCAAGAACT 

GTTGAGTTCTTAAAAGATCCAAAACGATTCACAAAACnTGGAGCCCGTATTCCAGCAGGTGTTC^^ 

CTCCGGGGACAGGTAAAACTTTGCTTGCTAAGGCAGTCGCTGGAGAAGCAGGTGTTCCATT(^^ 

TTCTGACTTTGTAGAAATGTTTGTCGGAGTTGGAGCTAGTCGTGTTCGCTCTCm 

GCACCAGCTATCATCTTTATCGATGAAATTGATGCTGTTGGACGTCAACGTGGAGTCGGTCTCGGCGGAGGTAATG 

ACGAACGTGAACAAACCnTGAACCAACTTTTGATTGAGATGGATGGTTTTGAGGGAAATGAAGGGAT^ 

TCGCTGCGACAAACCGTTCAGATGTACTTGACCCTGCCCITTTGCGTCCAGGACGTTTTGATAGA^^ 

TGGTCGTCCTGATGTTAAAGGTCGTGAAGCAATCTTGAAAGTTCACGCTAAGAATAAGC 

CTTTAGCAGAAGATGTTGATTTGAAATTAGTGGCTCAACAAACTCCAGGCTTTGT^ 

CrTGAATGAAGCAGCriTAGTTGCTGCTCGTCGCAATAAATCGATAATTGATGC^ 

GATAGAGTTATTGCTGGACCTTCTAAGAAAGATAAGACAGTTTCACAAAAAGAACGAGAATTGGTTG(^ 

GAGGCAGGACATACCATTGTTGGTCTAGTCTTGTCGAATGCTCGCGTTGTCCATAAGGTTACAATTGTACCACGCG 

GCCGTGCAGGCGGATACATGATTGCACTTCCTAAAGAGGATCAAATGCTTCTATCTAAAGAAGATATGAAAGAGC 

AATTGGCTGGCTTAATGGGTGGACGTGTAGCTGAAGAAATTATCITTAATGTCCAAACCACAGGAGCTTCA^ 

ACTTTGAACAAGCGACACAAATGGCACGTGCAATGGTTACAGAGTACGGTATGAGTGAAAAACTTGGCCCAGTAC 

AATATGAAGGAAACCATGCTATGCTTGGTGCACAGAGTCCTCAAAAATCAATTTCAGAACAAACAGCTTATGAAA 

TTGATGAAGAGGTTCGTTCATTATTAAATGAGGCACGAAATAAAGCTGCTGAAATTATTCAGTCAAATCGTGAAAC 

TCACAAGTTAATTGCAGAAGCATTATTGAAATACGAAACATTGGATAGTACACAAATTAAAGCTCTTTACGAAAC 

AGGAAAGATGCCTGAAGCAGTAGAAGAGGAATCTCATGCACTATCCTATGATGAAGTAAAGTCAAAAATGAATGA 

CGAAAAATAA 
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4125.10 

atgagggaaccagattttttaaatcattttctcaagaagggatatttcaaaaagcatgctaaggcgg 

tttctggtggattagattccatgtttctatttaaggtattgtctacrratcaaaaagagttagagat^ 

ctagctcatgtgaatcataagcagagaattgaatcagattgggaagaaaaggaattaaggaagttggctgctgaa 

gcagagcttcctatttatatcagcaatttttcaggagaattttcagaagcgcgtgcacg/^ 

tttttcaagaggtcatgaaaaagacaggtgcgacagctttagtcactgcccaccatgctgatgatcagg^^ 

cgatttttatgcgcttgattcgaggaactcgcttgcgctatctatcaggaattaaggagaagcaa 

gatagaaatcattcgtcccttcttgcattttcagaaaaaagactttccatcaatttttcact^ 

aatcaggagaatcattattttcgaaatcgtattcgaaattcitacttaccagaattggaaa^ 

ttagggatgcaatcttaggcattggcaatgaaattttagattatgatttggcaatagctgaat^ 

taatgtggaagarrracagcagttattttcttacrctgagtctacacaaagagtttta 

gttttccagatttgaatcttacaaaagctcagtttgctgaagttcagcagattttaaa^ 

tcatccgattaaaaatggctatgaattgataaaagagtaccaacagtttcagatttgtaaaat cagtccgca ggct 

gatgaaaaggaagatgaacntgtgttacactatcaaaatcaggtagcttatcaaggatatttattct 

ttccattagaaggtgaattaattcaacaaatacctgtttcacgtgaaacatccatacacattcgtcatcgaa;^ 

aggagatgttttgattaaaaatgggcatagaaaaaaactcagacgtttatttattg 

ATTTGAAAATCCCTATGGAAAAGAGAAACTCTGCTCTTATTATTGAGCAATT^ 

AATTGCGACCAATAATTTGAGTAAAAAAACGAAAAATGATATAATGAACACTGTACTTTATATAG 

TAGGTAA 

4126.1 

ATGAAGCGTTCTTCTCirrTAGTTAGAATGGTTATTTCCATCITrCT 

ACTTTCTACTATCAATCAAGTTCITCAGCCATTGAGGCCACCATTGAGGGCAACAGCCAAACGAC^ 
ACTAGCCACTTTATTCAGTCTTATATCAAAAAACTAGAAACCACCrCGACTGGTT^ 

TGGCCTATGCTGAGAATCCCAGTCAAGACAAGGTCGAGGGAATCCGAGATTTGTTTTTGACCATCITGAAGT^ 

TAAGGACITGAAAACrGTTGTGCTGGTGACCAAATCTGGTCAGGTCATTTCTACAGATGACAGTGTGCAGATG/^ 

AACTTCCTCTGATATGATGGCTGAGGATTGGTACCAAAAGGCCATTCATCAGGGAGCTATGCCTGTTT^ 

GCTCGTAAATCAGATAGTCAGTGGGTCATTTCTGTCACTCAAGAACTTGTTGATGCAAAGGGAGCCAATC^ 

TGCITCGTTTGGATATTTCTTATGAAACTCTGGAAGCCTATCTCAATCAACTCCAGTTG^ 

CTTCATTATCAATGAAAACCATGAATTTGTCTACCATCCTCAACACACAGTTTATAGTTCGTCTAG 

GCTATGAAACCCTACATCGATACAGGTCAGGGTTATACTCCTGGTCACAAATCCTACGTCAGTCAAGAGAAGATT 

GCAGGAACTGATTGGACGGTGCTTGGCGTGTCATCATTGGAAAAGTTAGACCAGGTTCGGAGTCAGCTCTTGTGG 

ACCnTGCITGGGGCCAGTGTCACATCrCTTCnTGTCTGTCrcrrGCTTAGTC 
TCCriTGAAGGATTTGAGAGAAACCATGTTGGAAATTGCTTCTGGTGCTCAAAATCTTC^ 

GCCTATGAACTGAGAGAAGTAACrCGCCAATTTAATGCTATGTTGGATCAGATTGATCAGTTGATGGTAGCTAT^ 

GTAGCCAGGAAGAAACGACCCGTCAGTACCAACTTCAAGCCCITTCGAGCCAGATTAATCCACATTTCCTCTAT/^ 

CACTTTGGACACCATCATCTGGATGGCTGAATTTCATGATAGTCAGCGAGTGGTGCAGGTGACCAAGTCCTTGGCA 

ACCTATTTCCGCTTGGCGCTCAATCAAGGCAAGGACTTGATTTGTCTCTCTGACGAAATCAATC ATC 

ATCTCTTTATCCAGAAACAACGCTATGGAGATAAGCTGGAATACGAAATTAATGAAAATGTTGCCT^ 

AGTCTTACCCAAGCTGGTCCTACAACCCCTTGTAGAAAATGCTCTTTACCATGGCATTAAGGAAAAGGAAGGT^ 

GGGCCATATTAAACTTTCTGTCCAGAAACAGGATTCGGGATTGGTCATCCGTATTGAGGATGATGGCGTTGGCTTC 

CAAGATGCTGGTGATAGTAGTCAAAGTCAACrCAAACGTGGGGGAGTTGGTCTTCAAAATGTCGATCAACGGCTC 

AAACITCATTTTGGAGCCAATTACCATATGAAGATTGATTCTAGACCCCAAAAAGGGACGA^ 

ATAAATAGAATAGAAACTAGCTAA 



4126.7 

ATGAAGCGTTCTTCTCrriU'AGTTAGAATGGTTATTTCCATCTTTCTGGTCT^ 

ACTTTCTACrATCAATCAAGTTCnTCAGCCATTGAGGCCACCATrGAGGGCAACAGCCAAACGACCATCAGCCAG 

ACrAGCCACTTTATTCAGTCTTATATCAAAAAACTAGAAACCACCTCGACTGGTTTGACCCAGCAGACGGATG^ 

TGGCCTATGCTGAGAATCCCAGTCAAGACAAGGTCGAGGGAATCCGAGATTTGTTTTTGACCATCTTGAAGTCAGA 

TAAGGACTTGAAAACTGTTGTGCTGGTGACCAAATCTGGTCAGGTCATTTCTACAGATGACAGTGTGCAGATGAA 

AACITCCTCTGATATGATGGCTGAGGATTGGTACCAAAAGGCCATTCATCAGGGAGCTATGCCTGTTTTGACT 

GCTCGTAAATCAGATAGTCAGTGGGTCATTTCTGTCACTCAAGAACTTGTTGATGCAAAGGGAGCCAATCTTGGTG 

TGCITCGTTTGGATATTTCITATGAAACTCTGGAAGCCrATCTCAATCAACTCCAGTTGGGGC 

CirCATTATCAATGAAAACCATGAATTTGTCTACCATCCTCAACACACAGTTTATAGTTCGTCTAGCA^ 

GCTATGAAACCCTACATCGATACAGGTCAGGGTTATACTCCTGGTCACAAATCCTACGTCAGTCAAGAGAAGATT 

GCAGGAACTGATTGGACGGTGCTTGGCGTGTCATCATTGGAAAAGTTAGACCAGGTTCGGAGTCAGCTCTTGTGG 

ACCITGCTTGGGGCCAGTGTCACATCTCTTCTTGTCTGTCTCTGCTTAGTGTGGTTCAGTCTTAAACGCTGGA^ 

TCCTTTGAAGGATTTGAGAGAAACCATGTTGGAAATTGCTTCTGGTGCTCAAAATCTTCGTGCCAAGGAAG^ 

GCCTATGAACTGAGAGAAGTAACTCGCCAATTTAATGCTATGTTGGATCAGATTGATCAGTTGATGGTAGCTATTC 

GTAGCCAGGAAGAAACGACCCGTCAGTACCAACnTCAAGCCCTTTCGAGCCAGATTAATCCACATTTCCTCTATAA 

CACITTGGACACCATCATCTGGATGGCTGAATTTCATGATAGTCAGCGAGTGGTGCAGGTGACCAAGTCCTTGGCA 

ACCTATTTCCGCTTGGCGCTCAATCAAGGCAAGGACTTGATTTGTCTCTCTGACGAAATCAATCATGTCCGCCAGT 

ATCTCTTTATCCAGAAACAACGCTATGGAGATAAGCTGGAATACGAAATTAATGAAAATGTTGCCTTTGATAAT^ 
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AGTCTTACCCAAGCTGGTCCTACAACCCCTTGTAGAAAATGCTCTTTACCATGGCATTAAGGAAAAGGAAGGTCA 

GGGCCATATrAAACTTTCXGTCCAGAAACAGGATTCGGGATTGGTCATCCGTATTGAGGATGATGGCGTTGGCTTC 

CAAGATGCTGGTGATAGTAGTCAAAGTCAACTCAAACGTGGGGGAGTTGGTCITCAAAATGTCGATCAACGGCTC 

AAACTTCATTTTGGAGCCAATTACCATATGAAGATTGATTCTAGACCCCAAAAAGGGACGAAAGTTGA^ 

ATAAATAGAATAGAAACTAGCTAA 

4127.4 

ATGTTTTTTAAATTATTAAGAGAAGCTCTTAAAGTCAAGCAGGTTCGATCAAAAATTT^ 

TTTGGTCTTTCGTATCGGAACTAGCATTACAGTTCCTGGTGTGAATGCCAATAGCTTGAATGCTT^ 

CCTTCTTAAACATGTTGAGCTTGGTGTCGGGGAATGCCCTAAAAAACTTTTCGATTT^ 

CTATATCACCGCTTCTATTGTTGTCCAACTCTTGCAAATGGATATTTTACCCAAGTTTGTAGAG^^ 

GGGGAAGTAGGTCGAAGAAAATTGAATCAAGCTACTCGTTATATTGCTCTAGTTCTCGCITT^ 

GGATTACAGCTGGTTTTAATACCTTGGCTGGAGCTCAATTGATTAAAACTGCnTTAACT 

GATTGGTATCATCTTAACAGCTGGTAGTATGATTGTCACTTGGTTGGGTGAGCAAATTACAGATAAGGGATACGGA 

AACGGTGTTTCCATGATTATCTTTGCCGGGATTGTTTCCTCAATTCCAGAGATGATTCAGGGCATCTATGTG 

ACTTTGTGAACGTCCCAAGTAGCCGTATCACTTCATCTATCATTTTCGTAATCATTTTGATT 

ATTATTTACTTTACAACTTATGTTCAACAAGCAGAATACAAAATTCCAATCCAATAT ACT 

CTCCATCTAGCrCTTACCnTCCGTTAAAAGTAAACCCTGCTGGAGTTATCCCTGTTATC^ 

GCAGCGCCTGCGGCTATTCTTCAGTTTTTGAGTGCCACAGGTCATGATTGGGCTTGGGTAAGGGTAGC^ 

TGTTGGCAACTACTTCTCCAACTGGTATTGCCATGTATGCTrTGTTGATTATTCT 

TACAGATTAATCCrGAAAAAGCAGCAGAGACCTACAAAAGAGTGGTGCCrATATCCATGGAGTTCGTCCTGGTA^ 
AGGTACAGAAGAATATATGTCTAAACTTCTTCGTCGTCTTGCAACTGTTGGTTCCCTCr^ 



4127.5 

ATGGATATTAGACAAGTTACTGAAACCATCGCCATGATTGAGGAGCAAAACTTCGATATTAGAACCATTACCATG 

GGGATTTCTCTTTTGGACTGTATCGATCCAGATATCAATCGTGCTGCGGAGAAAATCTATCAAAAAATTACGA 

AGGCGGCTAATTTAGTAGCTGTTGGTGATGAAATTGCGGCTGAGTTGGGAATTCCTATCGTTAATAAGCGTGTA 

GGTGACACCTATTTCTCTGATTGGGGCAGCGACAGATGCGACGGACTACGTGGTTCTGGCAAAAGCGCTTGATAA 

GGCTGCGAAAGAGATTGGTGTGGACTTTATTGGTGGTTTTTCTGCCTTAGTACAAAAAGGT^ 

GAGATTCTCATCAATTCCATTCCTCGCGCTTTGGCTGAGACGGATAAGGTCTGCTCGTCAGTCAAT ATCG GCTCAA 

CCAAGTCTGGTATTAATATGACGGCTGTGGCAGATATGGGACGAATTATCAAGGAAACAGCAA ATCIT TCAGATA 

TGGGAGTGGCCAAGTTGGTTGTATTCGCTAATGCTGTTGAGGACAATCCATTTATGGCGGGTGCCTTTCATGGTGT 

TGGGGAAGCAGATGTTATCATCAATGTCGGAGTTTCTGGTCCTGGTGTTGTGAAACGTGCTTTGGAAAAAG 

GGACAGAGCTTTGATGTAGTAGCCGAAACAGTTAAGAAAACTGCCTTTAAAATCACTCGTATCGGTCAATTGGT^ 

GTCAAATGGCCAGTGAGAGACTGGGTGTGGAGTTTGGTATTGTGGACTTGAGTTTGGCACCAACCCCTGCGGTTGG 

AGACTCTGTGGCACGTGTCCTTGAGGAAATGGGGCTAGAAACAGTTGGCACGCATGGAACGACGGCTG CCTTG GC 

CCTCnTGAACGACCAAGTTAAAAAGGGTGGAGTGATGGCCTGCAACCAAGTCGGTGGTTTATCTGGTGCCT^ 

CCTGTTTCTGAGGATGAAGGAATGATTGCTGCAGTGCAAAATGGCTCTCTTAATTTAGAAAA^ 

CGGCTATCTGTTCTGTTGGATTGGATATGATTGCCATCCCAGAAGATACGCCTGCTGAAACTATTGCGGCT 

TGCGGATGAAGCAGCAATCGGTGTTATCAACATGAAAACAACAGCTGTTCGTATCATTCCCAAAGGAAAAGAAGG 

CGATATGATTGAGTTTGGTGGTCTATTAGGAACTGCACCCGTTATGAAGGTTAATGGGGCTTCGTCTGTCGAC^ 

ATCTCTCGCGGTGGACAAATCCCAGCACCAATTCATAGTTTTAAAAATTAA 

4128.1 

ATGACACAGATTATTGATGGGAAAGCTTTAGCGGCCAAATTGCAGGGGCAGTTGGCTGAAAAGACTGCAAA^ 

AAGGAAGAAACAGGTCTAGTGCCTGGTTTGGTAGTGATTTTGGTTGGGGACAATCCAGCCAGCCAAG^^ 

CGCAACAAGGAGAGGTCAGCCCITGCGGCTGGTTTCCGTAGCGAAGTAGTACGGGTTCCAGAGACCATTACTCAA 

GAGGAATTGTTAGACCTGATTGCTAAATACAATCAGGATCCAGCTTGGCATGGGATTTTGGTTCAGTTGCCAT^ 

CAAAACACATTGATGAAGAGGCGGTTCTATTGGCTATTGACCCAGAAAAGGATGTGGATGGTTTCCATCCTCTAA 

ACATGGGGCGTCTTTGGTCTGGTCATCCAGTCATGATTCCnTCGACACCGGCAGGAATTATGGAAATGTTCCATGA 

ATATGGGATTGACTTGGAAGGTAAAAATGCAGTCGTCATCGGTCGATCCAATATTGTCGGAAAACCTATGGCCCA 

GCTTCTTTTGGCAAAGAATGCAACAGTAACCITGACTCACTCACGTACTCATAATCTTTCCAAGGTGGCT 

GCAGATATTCTGGTTGTTGCAATCGGTCGTGCCAAGTTTGTGACTGCTGACITTGTCAAACCAGGTGCGGTAGTCA 

TTGACGTTGGGATGAACCGCGATGAAAATGGTAAGCTCTGTGGGGATGTTGATTATGAGGCGGTTGCCCCACTTGC 

TAGCCACATTACGCCAGTCCCTGGAGGTGTCGGTCCTATGACCATTACTATGCTGATGGAGCAAACCTATCAGGCA 

GCACTTAGGACATTGGATAGAAAATAA 

4128.2 

ATGTCTAAATTTAATCGTATTCATTTGGTGGTACTGGATTCTGTAGGAATCGGTGCAGCACCAGATGCTAATAACT 

TTGTCAATGCAGGGGTTCCAGATGGAGCTTCTGACACACTGGGACACATTTCAAAAACAGTTGGTTTGAATGTCCC 

AAACATGGCTAAAATAGGTCTTGGAAATATTCCTCGTGAAACTCCTCTTAAGACTGTAGCAGCTGAAAGCAATCC 

AACTGGATATGCAACAAAATTAGAGGAAGTATCTCTTGGTAAGGATACTATGACTGGACACTGGGAAATCATGGG 

ACTCAACATTACTGAGCCTTTCGATACrrrCTGGAACGGATTCCCAGAAGAAATCCTGACAAAAATCGAAGAAT^ 

TCAGGACGCAAGGTTATTCGTGAAGCCAACAAACCTTATTCAGGAACGGCTGTTATCTATGATTTTGGACCACGTC 



SUBSTITUTE SHEET (RULE 26) 



wo 00/06737 



PCT/GB99/02451 



40 



AGATGGAAACTGGAGAGTTGATTATCTATACrrCAGCTGACCCTGTTTTGCAGATTGCTGCCCACGAAGACATTA^ 

TCCTTTGGATGAATTGTACCGTATCTGTGAATACGCrCGTTCGATTACCCTTGAGCGTCCTGCCCr^ 

TCATTGCrCGCCCITATGTAGGTGAACCAGGTAACTTCACrCGTACGGCAAACCGTCGTGAC^ 

ATTTTTCCCAACTGTTTTGGATAAATTGAATGAGGCTGGTATCGATACITATGCTGTGGGTA>^^ 

TTTAACGGTGCTGGTATCAACCATGACATGGGTCACAACAAGTCAAATAGTCATGGAATTGATACACTATTGAAG 

ACTATGGGACTTGCTGAGTTTGAAAAAGGATTCTCATTCACAAACCTAGTTGACTTTGATGCCCT^ 

GTCGTAATGCTCACGGTTACCGTGATTGCTTGCATGAGTTTGATGAACGCTTACCTGAAATTATCGCAGCr 

AGAGAATGACCTTCTCTTGATTACTGCGGACCATGGAAATGACCCAACGTATGCAGGAACGGATCACACTCGGGA 

ATATATTCCATTGTTGGCCTATAGCCCTGCCTTTAAAGGAAATGGTCTCATTCCAGTAGGACATTTTGCAGATAT^ 

CAGCGACTGTTGCCGATAACTTTGGTGTGGAAACTGCTATGATTGGGGAAAGTTTCTTAGATA^ 

4129.2 

ATGTTTATTTCCATCAGTGCTGGAATTGTGACATTTTTACTAACTTTAGTAGA^ 

TAGAAAGGCGCAAATTACAGGCCAGCAGATGCATGAGGATGTCAAACAGCATCAGGCAAAAGCTGGGACTCCTA 
CAATGGGAGGTTTGGTTTTCTTGATTACTTCTGTTTTGGTTGCTTTC^^ 

ATAATGTGGGAATGATTTITGTTCATCTTGGTCITGTATGGCTTGGTCGGATTTTTAGATGAC^^ 
CGTAAAATCAATGAGGGGCTTAATCCTAAGCAAAAATTAGCTCTTCAGCTTCTAGGTGGAGTTATC^ 
TCTATGAGCGCGGTGGCGATATCCTGTCTGTCTTTGGTTATCCAGTTCATTTGGGATTTTTCTATAT^^ 
CITTTCTGGCTAGTCGGTTTTTCAAACGCAGTAAACTTGACAGACGGTGTTGACGGT^ 

GATTAGTTTGTCTGCCTATGGAGTTATTGCCTATGTGCAAGGTCAGATGGATATTCTTCTAGTGATTCTTGCCATGA 
TTGGTGGTTTGCTCGGTTTCTTCATCTTTAACCATAAGCCTGCCAAGGTCTT^ 

CTAGGTGGGATGCTGGCAGCTATCTCTATGGCTCTCCACCAAGAATGGACTCTCTTGATTATCGGAATTGTGTATG 

TTTTTGAAACAACTTCTGTTATGATGCAAGTCAGTTATTTCAAACTGACAGGTGGTAAACGTAT^^ 

GCCTGTACATCACCATTTTGAGCTTGGGGGATTGTCTGGTAAAGGAAATCCTTGGAGCGAGTGGAAGGTTGACTTC 

TTCTTTTGGGGAGTGGGACTTCTAGCAAGTCTCCTGACCCTAGCAATTTTATATl^ 
4133.1 

TTGTTTAAGAAAAATAAAGACATTCTTAATATTGCATTGCCAGCTATGGGTGAAAA(ni^^ 

GAATGGTGGACAGTTATTTGGTTGCTCATTTAGGATTGATAGCTATTTCAGGGGTTTCAGTAGCT 

CACCATTTATCAGGCGATTTTCATCGCTCTGGGAGCTGCTATTTCCAGTGTTATTTCAAAAAGCAT^ 

GACCAGTCGAAGTTGGCCTATCATGTGACTGAGGCGTTGAAGATTACCTTACTATTAAGTTTCCTTTTAGGA 

GTCCATCITCGCTGGGAAAGAGATGATAGGACTTTTGGGGACGGAGAGGGATGTAGCTGAGAGTGGTGGACTGTA 

TCTATCTTTGGTAGGCGGATCGATTGTTCTCTTAGGTTTAATGACTAGTCTAGGAGCCTTGATTCGTC 

AATCCACGTCTGCCTCTCTATGTTAGTTTTTTATCCAATGCCTTGAATATTCnTTTT^ 

CTGGATATGGGGATAGCTGGTGTTGCTTGGGGGACAATTGTGTCTCGTTTGGTTGGTCTTGTGATT^ 

AATTAAAACTGCCTTATGGGAAGCCAACTTTTGGTTTAGATAAGGAACTGTTGACCTT^ 

AGAGCGACnTATGATGAGGGCTGGAGATGTAGTGATCATTGCCTTGGTCGTTTCTTTTGGGACGGAGGCAGTTGCT 

GGGAATGCAATCGGAGAAGTCTTGACCCAGTTTAACTATATGCCTGCCTTTGGCGTCGCTACGGCAACGGTCATGC 

TGTTGGCCCGAGCAGTTGGAGAGGATGATTGGAAAAGAGTTGCTAGTTTGAGTAAACAAACCTTTTGGCT^ 

GTTCCTCATGTTGCCCCTGTCCTTTAGTATATATGTCTTGGGTGTACCATTAACTCATCTCTATACGACTGAT^ 

TAGCGGTGGAGGCTAGTGTTCTAGTGACACTGTTTTCACTACTTGGGACCCCTATGACGACAGGAACAGTCATCTA 

TACGGCAGTCTGGCAGGGATTAGGAAATGCACGCCTCCCTTTTTATGCGACAAGTATAGGAATGTGGTGTATCCGC 

ATTGGGACAGGATATCTGATGGGGATTGTGCTTGGTTGGGGCTTGCCTGGTATTTGGGCAGGGTCTCTCTTGGATA 

ATGGTTTTCGCTGGTTATTTCTACGCTATCGTTACCAGCGCTATATGAGCTTGAAAGGATAG 

4135.2 

ATGCAAACTCAAGAAAAACACTCGCAAGCAGCCGTTCTTGGCTTGCAGCACTTACTAGCCATGTACT 

ATCCTGGTTCCCATCATGATTGCGACAGCCCITGGCTATTCAGCTGAGCAGTTGACCTACCTGATT^ 

TCTTCATGTGTGGGGTGGCAACCTTCCTCCAACTCCAACTCAACAAATACTTTGGGATTGGACrCCCAGTCG^ 

TGGAGTTGCATTCCAGTCGGTCGCTCCCrTGATTATGATTGGGCAAAGCCATGGTAGTGGCGCTATGTT^ 

CTTATCGCATCTGGGATTTACGTGGTTCTTGTTTCAGGCATCITCrrCAAAAGTAGCCA^ 

AACAGGATCTGTTATTACCACGATTGGTTTAACCTTGATCCCTGTCGCTATTGGAAATATGGGAAATAACGTTCC^ 

GAGCCAACTGGTCAAAGTCTCTTGCTTGCAGCTATTACTGTTCTGATTATCCTCnTGATCAACATC^^ 

GATTTATCAAGTCTATCTCTATTTTGATTGGTCTGGTTGTTGGAACTGCCATTGCTGC^ 

TTCTCTCCTGTTGCGGTAGCTCCACnTGTCCATGTCCCAACTCCACTCTACTTTGGGATGCCAACCTT^ 

ATCTATTGTCATGATGTGTATCATCGCAACGGTGTCTATGGTTGAGTCAACTGGTGTTTATCTGGCC^ 

TCACAAAGGATCCAATCGACAGCACGCGCCTTCGCAACGGATACCGCGCAGAAGGTTTGGCCGTACTTCTCGGAG 

GAATCTTTAACACCTTCCCrTACACCGGATTTTCACAAAACGTTGGTTTGGTTA^ 

CCTGCCAATCTACTACGCAGCTGGTTTCCTGGTTCTCCnTGGACrGCTTCCTAAGT^ 

TTCCAAGCTCCGTCCTCGGTGGTGCCATGCTGGTAATGTTTGGTTTTGTATCAATTCAAGGGATGCAAATCCTCGC 
CCGTGTTGACriTGCTAACAATGAACACAACTTCCTTATCGCAGCTGTTTCAATCGCTGCAGGTGTCG 
AACAGTAATCTCTTTGTCAGCATGCCGACAGCCITCCAAATGTTCTTCTCAAACGGAATCGTCGTAGCCAGCCTAC 
TCGCTATTGTCCTCAATGCCGTATTAAATCATAAAAAGAAATAA 

4136.2 
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ATGAAAGATAGAATAAAAGAATATTTACAAGACAAGGGAAAGGTGACTGTTAATGATTTGGCTCAGGCI^ 
AAAGACAGTTCCAAGGATTTTCGTGAGTTGATTAAAACCITGTCCITAATGGAA 

AAGAAGATGGTAGTCrGACATTAGAAATTAAGAAAAAACATGAGATTACCCTCAAGGGGATTTTTCATGCCCATA 
AAAATGGCTTTGGCITTGTTAGTCTGGAAGGCGAGGAGGACGACCTTTTTGTAGGG^^^ 
5 TATTGATGGTGATACCGTCGAGGTAGTGATTAAGAAAGTCGCTGACCGCAATAAGGGAACAGCAGCAGAAGCCAA 
AATTATTGATATCCTAGAACACAGTTTGACAACAGTTGTCGGGCAAATCGTTCTGGATCAGGAAAAACCTAAGTAT 
GCTGGCTATATTCGTTCAAAAAATCAGAAAATCAGTCAACCGATTTATGTTAAGAAACCAGCCCTAAAATTAGAA 
GGAACAGAAGTTCTCAAAGTCTTTATCGATAAATACCCAAGCAAGAAACATGATTTCTTTGTCGCGAGTGTTCT 
ATGTAGTGGGACACTCAACGGATGTCGGAATTGATGTTCITGAGGTCTTGGAATCAATGGACATTGTATCCGAGTT 

10 TCCAGAAGCTGTTGTTAAGGAAGCAGAAAGTGTGCCTGATGCTCCGTCTCAAAAGGATATGGAAGGTCGTCTGGA 
TCTAAGAGATGAAATTACCTTTACCATTGACGGTGCGGATGCCAAGGACTTGGACGATGCAGTGCATATCAAGGC 
TCTGAAAAATGGCAATCTGGAGTTTGGGGTTCACATCGCAGATGTTTCTTATTATGTGACCGAGGGGTCTGCCC^ 
GACAAGGAAGCCCnTAACCGTGCGACTTCTGTTTACGTGACAGACCGAGTGGTGCCAATGCITCCAGA^ 
TCAAATGGCATCTGCTCTCTCAATCCCCAAGTTGACCGCCTGACCCAGTCTGCTATTAT 

15 GGAGATTGATAAACATGGTCGTGTGGTCAACTATACCATTACACAAACAGTTATCAAGACCAGTTTTCGTATGACC 
TATAGCGATGTCAATGATATCCTAGCTGGCGATGAAGAAAAGAGAAAAGAATATCATAAAA TTGTA TCAAGTATC 
GAACTCATGGCCAAGCirCATGAAACTTTAGAAAACATGCGTGTGAAACGTGGAGCTCT 
AAGCGAAGATTTTAGTGGATAAACAAGGTAAGCCTGTTGATATCGTTCITCGGCAGCGTGGTATT GCCGA G 
TGATTGAGTCTTTTATGTTGATGGCTAATGAAACAGTTGCCGAACATTTCAGCAAGTTGGAl^ 

20 CGAATTCACGAGGAGCCTAAGGCTGAAAAGGTTCAGAAGTTTATTGATTATGCTTCGAGTTTTGGC^ 

ATGGAACTGCCAGTGAGATTAGTCAGGAGGCACTTCAAGACATCATGCGTGCTGTTGAGGGAGAACCTTATGCAG 
ATGTATTGTCCATGATGCTTCnTCGCTCTATGCAGCAGGCTCGTTATTCGGAGCACAATCACGGCCACTATGG 
AGCTGCTGACTATTATACTCACITTACCAGTCCAATTCGTCGTTATCCAGACCTTCTTGT^ 
ATTACGGCCGTTCTAAGGAAATAGCAGAGCATTTTGAACAAGTGATTCCAGAGATTGCGACCCAGTCITC 

25 GTGAACGTCGTGCCATAGAAGCTGAGCGTGAAGTCGAAGCCATGAAAAAGGCTGAGTATATGGAAGAATACGTGG 
GTGAAGAGTATGATGCAGTTGTATCAAGTATTGTCAAATTCGGTCTCTTTGTCGAATTGCCAAACACAGTTGAAGG 
CTTGATTCACATCACTAATCTGCCTGAATTTTATCATTTCAATGAGCGTGATTTGACrc^ 

GGTATCACTTTCCGAGTGGGTCAGCAGATCCGTATCCGTGTTGAAAGAGCGGATAAAATGACTGGAGAGATTGAT 
TTTTCATTCGTACCTAGTGAGTTTGATGTGATTGAAAAAGGCTTGAAACAGTCTAGTCGT 
30 AGTGGCAGAGGGCGTGATTCAAATCGTCGTTCGGATAAGAAGGAAGAC AAGAG AAAATCAGGACGCTCAAATGA 
TAAGCGTAAGCATTCACAAAAAGACAAGAAGAAAAAAGGAAAGAAACCITTTTACAAGG 
GAGCCAAGCATGGCAAAGGGCGAGGGAAAGGTCGTCGCACAAAATAA 

4137.2 

35 ATGGGCACAACAGGATTTACAATAATTGACTTAATTATCTTGATTGTTTATTTAC^ 

CTATTTCTCTAAAAAAGAGATGAAAGGAAAAGAGTTCTTTAAAGGAGATGGTTCGGTTCC^ 
GTATCCATTTTTGCCACAATGCTCAGTCCGATTTCCTTCTTGGGACTCGCTGGTAGCrCT^ 
TTTATGGTTTGCTCAATTAGGGATGGTAGTAGCTATTCCACTGACAATTCGTTTTATC^ 
TAGACATCGATACGGCATATGATTACTTGGATAAACGTTTTAATTCTAAAGCACTTCGTATTATTTC^ 

40 GTTTATTATTTATCAATTGGGACGTATGTCTATCATTATGTACCTCCCATCAGCTGGTTTATCAGTATTGACAG 
TTGACATCAATATTTTGATTATTTTGATGGGTGTAGTTGCAATTGTTTATTCTTAT^ 
TTATGGACAGACTTTATTCAAGGTGTGATTCTGATTAGTGGTGTCGTTTTAGCnTTATTTGTACT 
TAAAGGTGGCTTTGGTGCAGTAGCAGAAACATTAGCAAACGGGAAATTCCITGCTGCAAATGA^ 
TCCTAACTTGCITTCAAACTCCATCTTTTTAATTGTGATGGGTTCAGGC^ 

45 TCAAGATTTGGTTCAACGTTrrACTACAACACAAAATATTAAGAAACTTAATA^ 

TTGTCACTTGCAACTGCAACAGTCTTTTACTTGATTGGTACAGGCnTGTACGTATTCT 

ATAGTGCAGCTAGCAATATCCCTCAAGACCAAATCTTTATGTACXrrATTGCATACCAGTTACCAGTAGGTATCAC 
AGGTTTGATCrTGGCAGCGATTTATGCAGCATCTCAATCAACTATTTCAACAGGTTT^ 

TGGACATTGGATATTCAAGATGTCATTTCTAAAAATATGTCAGACAATCGTCGTACGAAAATTGCACAATTCGTAT 
50 CTCTAGCAGTAGGTTTATTCTCAATTGGTGTTTCCATTGTCATGGCTCACTCAGATATTAAATCTGCATAC 
TTCAATAGTTTCATGGGACTTGTACTTGGTCTACTTGGTGGTGTATTTATTCTTGGATT^ 

TAAACAAGGTGCTTATGCAGCGCTGATTGTATCAACCATCGTCATGGTATTTATTAAATACTTCCTTCCTCCAACA 
GCrGTTAGCrACTGGGCATATTCATTGATTTCAATCrcrGTATCAGTAGTTTCAGGTTATATTGTATCTG 
TGGAAATAAAGTATCTGCACCTAAATATACAACGATTCATGATATTACAGAAATTAAAGCGGATTCAAGTTGGGA 
55 AGTTCGTCACTAA 

4138,1 

ATGAAATTTAGTAAAAAATATATAGCAGCTGGATCAGCrGTTATCGTATCCTTGAGTCTATGTGCCTATGCACTi^ 
ACCAGCATCGTTCGCAGGAAAATAAGGACAATAATCGTGTCTCTTATGTGGATGGCAGCCAGTCAAGTCAGAAAA 

60 GTGAAAACTTGACACCAGACCAGGTTAGCCAGAAAGAAGGAATTCAGGCTGAGCAAATTGTAATCAAAATTACAG 
ATCAGGGCTATGTAACGTCACACGGTGACCACTATCATTACTATAATGGGAAAGTTCCTTATGATGCCCTCTTTAG 
TGAAGAACrCTTGATGAAGGATCCAAACTATCAACTTAAAGACGCTGATATTGTCAATGAAGTCAAGGGTGGTTA 
TATCATCAAGGTCGATGGAAAATATTATGTCTACCTGAAAGATGCAGCTCATGCTGATAATGTTCGAACTAAAGAT 
GAAATCAATCGTCAAAAACAAGAACATGTCAAAGATAATGAGAAGGTTAACTCTAATGTTGCTGTAGCAAGGTCT 

65 CAGGGACGATATACGACAAATGATGGTTATGTCTTTAATCCAGCTGATATTATCGAAGATACGGGTAATGCTTATA 
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TCGTTCCTCATGGAGGTCACTATCACTACATTCCCAAAAGCGATTTATCTGCTAGTGAATTAGCAGCAGCTAAAGC 

ACATCTGGCTGGAAAAAATATGCAACCGAGTCAGTTAAGCTATTCTTCAACAGCTAGTGACAATAACACGCAAT^ 

TGTAGCAAAAGGATCAACTAGCAAGCCAGCAAATAAATCTGAAAATCTCCAGAGTCTTTTGAAGGAACTCT 

TTCACCTAGCGCCCAACGTTACAGTGAATCAGATGGCCTGGTCTTTGACCCTGCTAAGATTATCAGTCGTACACCA 

AATGGAGTTGCGATTCCGCATGGCGACCATTACCACITTATTCCTTACAGCAAGCTTTCTGCC^ 

TTGCCAGAATGGTGCCTATCAGTGGAACTGGTTCTACAGTTTCTACAAATGCAAAACCTAATGAAGTAGTGTC^ 

TCTAGGCAGTCTTTCAAGCAATCCTTCTTCTTTAACGACAAGTAAGGAGCTCTCTTCAGCAT 

TTAATCCAAAAGATATCGTTGAAGAAACGGCTACAGCTTATATTGTAAGACATGGTGATCATTTCCATTACATTC 

AAAATCAAATCAAATTGGGCAACCGACTCTTCCAAACAATAGTCrAGCAACACCTTCTCCATCTC^ 

CCAGGAACTTCACATGAGAAACATGAAGAAGATGGATACGGATTTGATGCTAATCGTATTATCGCTGAAGATGAA 

TCAGGTTTTGTCATGAGTCACGGAGACCACAATCATTATTTCTTCAAGAAGGACTTGACAGAAGAGCAAATTAAG 

GTGCGCAAAAACATTTAG 

4139.1 

ATGAAAAAAAGAGCAATAGTGGCAGTCATTGTACTGCTTTTGATTGGGCTGGATCAGTTGGTCAAATCCT 

TCCAGCAGATTCCACTGGGTGAAGTGCGCTCCTGGATCCCCAATTTCGTTAGCITGACCTACCrGCAAj^ 

TGCAGCCrTTTCTATCTTACAAGATCAGCAGCTGTTATTCGCTGTCATTACTCT 

GGTATTTACATAAACACATGGAGGACTCATTCTGGATGGTCITGGGTTTGACTCTA^ 

AAACTTTATTGACAGGGTCAGTCAGGGCTTTGTTGTGGATATGTTCCACCTTGACn^ 

ATGTGGCAGATAGCTATCTGACGGTTGGAGTGATTATTTTATTGATTGCAATGCTAAAAGAGGAAATAAATGGAA 
ATTAA 

4139,5 

ATGAATACAAATCITGCAAGTTTTATCGTTGGACTGATCATCGATGAAAACGACCGTTT^ 

ATGGTCAAACCTATGCrCTTGCTAAGGAAGAAGGCCAACATACAGTAGGGGATACGGTCAAAGGTTTTGCATAC^ 

CGGATATGAAGCAAAAACTCCGCCTGACAACCTTAGAAGTGACTGCCACTCAGGACCAATTTGGTTGGGGACGTG 

TCACAGAGGTTCGTAAGGACTTGGGTGTCTTTGTGGATACAGGCCTTCCTGACAAGGAAATCGTTGTGTCACTCGA 

TATTCTCCCTGAGCTCAAGGAACTCTGGCCTAAGAAGGGCGACCAACTCTACATCCGTCTTGAAGTGGATAAGAA 

AGACCGTATCrGGGGCCTCTTGGCTTATCAAGAAGACITCCAACGTCTTGCTCGTCCTGCCTACAACAA 

AACCAAAACTGGCCAGCCATTGTTTACCGTCTCAAGCTGTCAGGAACTTTTGTTTACCTACCAGA 

TTGGTTTTATTCATCCTAGCGAGCGTTACGCAGAGCCACGTTTGGGGCAAGTATTAGATGCGCGCGTTATTGGT^ 

CCGTGAAGTGGACCGCACTCTGAACCrCTCCCrCAAACCACGCrCCTTTGAAATGTTGGAA 

ATTTTGACTTATTTGGAAAGCAATGGCGGTTTCATGACCTTAAATGACAAGTCATCrrCCAGACGACATC/^ 

CCTTTGGCATTrCTAAAGGTCAGTTCAAGAAAGCTTTAGGTGGTCTTATGAAGGCTGGTAA^ 

GTTTGGGACAGAGTTGATTTAG 

4139.8 

ATGAAAGATGTTAGTCTATTTTTATTGAAAAAAGTTTTCAAAAGCCGCTTAAA 

ATCTGTACTCGGTGTTACCTTTTATTTAAATAGTCAGACTGCAAACTCACACAGCTTGGAGAGCAGGT^ 

CGCATTGCAGCCAACGAGAGGGCTATCAATGAAAATGAAGAGAAACTCTCCCAAATGTCTGATACCAGC TCGGA G 

GAATACCAGTTTGCTAAAAATAATTTAGACGTGCAAAAAAATCITTTGACGCGAAAGACAG^^ 

TAAAAGAAGGGCGCTGGAAAGAAGCCTACTATTTGCAGTGGCAAGATGAAGAGAAGAATTATGAATTTGTA^^ 

ATGACCCGACTGCTAGCCCTGGCTTAAAAATGGGGGTTGACCGCGAACGGAAGATTTACCAAGCCCT 

TGAACATAAAAGCACATACITTGGAGTTTCCGACCCACGGGATTGATCAGATTGTCTGGATTT^ 

CCCTVAGTTTGTTTGTGGTTGCTATTATTTTTATGCTAACACAACT 

ACAGCTCACTTATATCCTGTTTCAAAAGTGACATTTGCAATATCCTCTCnTGGAGTTGGA 

TGCTGTTTATCGGAATCTGTGGCrrrTCTTTTCTAGTGGGAAGTCTGATAAGTGGTT^ 

TACCCAATTTATAGCTTAGTGAATCAAGAAGTAACTATTGGGAAAATACAAGATGTATTATT^ 

TAGCnrrCTTAGCCnTTATCGTCATTGTGGAAGTTGTGTACTTGATTGCTTAC^^ 

CTCTTTCTTrCACTCATTGGGATTGTTGGCTTATTGTTTGGTATCCAAACCATTC 

TCTGATTCCCnTTACTTACTTGCGTTCAGTGGAGATTTTATCTGGAAGATTACCTAAGCA 

TAAATTGGAGCATGGGAATGGTCTTACTTCCTTGCCrGATTATCTTTTTGCTAT^ 

TGGGGAAGTTCACAGAAAAAAGAATTTTTTAATAGATTCTAG 

4141.1 

ATGATGAAGTTCATATTGGATATTGTTAGTACACCAGCTATTTTAGTAGCnTTAATTGCAATCT^ 

TCAGAAGAAGAAATTACCTGATATTATTAAAGGTGGAATTAAGACCTTTGTTGGTTTCTTAGT^ 

GCAGGAATTGTACAAAATTCITTAAATCCATTTGGTACCATGTTTGAGCATGCTT^ 

GAATAATGAAGCAATTGTAGCTGTAGCTTTAACAACATATGGCTCAGCTACTGCAATGATTATGTTTGCAGGCATG 

ATGTATGATTGCGGTCATTTTATCAGTTGCTGGCTTTACTAGCITGCCTCTCATCTTACTAGGAGGATTAGCA 
GTATTATTATGAGTATTTCCCCAGCATTTGTGCAAAAATATATGGTTCAATTAACTGGAAATGACAAGGTAGCTTT 
AGGTCATTTCAGTTCTTTGGGATATTGGTTGAGTGGTTTTACTGGTAGCCTTATCGGTGACAAATC^ 
GAGGACATTAAATTTCCAAAGAGTTTAGCTTTTTTACGTGATAGTACTGTTAGTATTACT^ 

TTACATTATTGTAGCTATCTTTGCAGGGTCAGAATATATAGAAAAAGAAATCAGTAGTGGTACAAGTGGTCTAGTT 
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TATGCTTTACAATTAGCAGGTCAATTTGCAGCAGGGGTATTTGTTATTTTAGCAGGTGTTCGCCITAT^^ 

AATTGTTCCAGCCITTAAAGGTATTTCAGAGCGTCTTGTACCTAATTCAAAACCTGC^ 

ATACTTATGCACCCAATGCAGTTCTAATTGGATTTATCTCTAGTTTTGTTGGTGGTTTAGTAAGTA^ 

ATTGCTTCAGGAACGGTTGTTATCTTACCAGGTGTTGTGCCTCATTTCITCTGTGGAGC^ 

GAATGCATCTGGTGGTGTTCGTGGAGCCACTATTGGAGCATTTTTACAAGGTATTTTAATCAGT^ 

TTTTAATGCCAGTTTTGGGAGGACTTGGTTTCCAAGGATCAACTTTCTCAGATGCAGATT^ 

ATTTTAGGAATGTTAAATCAATTTGGCTCACAAGCAGGCATTGTGATTGGTCITGTTC^ 

TGGAGTATCCTTTATTAAAAAGCCATCTGCAACGGAGGAATAA 

4142.3 

ATGATTAAAACATTTCTCTCTGCCCTTTCGGTCATTCrCTTTTCTA^ 

TCTAATCTTAACATTTGGCTATCTACCCAACCTATCTTGGCACAGATTTATGCCTTCCCCTT 
GGCTGCTATTTTAAGTTTCTTATTITTTTTCCTATCTTT^ 

TTTTGCTCTTACTATCGCTCATATTACTATTATTCGGAACAGATAAAACCCTTTCTTCT 

ACCrTAAAATTAGTAACTTGGAACGTCGCTAATCAAATAGAAGCACAACATATrGAGCGAATT^^ 

ACGCCGATATGGCTATATTCCCTGAACTAGCTACCAATATCAGAGGTGAGCAAGAAAACCAGAGAATCAAACTAT 

TGTTTCATCAAGTTGGACTTTCTATGGCCAACTATGATATTTTCACnTCTCCACCT 

GTGACTGTGATTGTCAAGAAAAGTTATGGTTTCTATACAGAAGCTAAAACTTTTCATA^ 

TTGTATTACATTCGAGAAAACAAAATATACCAGATATCATTGCCTTGCATACTGCGCCTCCTCTGCCAGG 

GGAAATCrGGAAGCAAGACTTAAACATCATTCATAATCAATTGGCTTCAAAATATCCAAAG^ 

GATTTTAATGCAACTATGCGTCATGGAGCACTTGCAAAAATAAGCTCrCATAGGGACGCAT^ 

CCTTTTGAAAGAGGAACTTGGAATAGCCAAAGTCCAAAACTTTTTAATGCA^ 

AAAACCACTACTATGTTAAAGATTTAGACATTGTAAGTTTTCAAAACTCTGATC^ 

CACATTTTAA 

4142.4 

ATGAATCCAATCCAAAGATCTTGGGCTTATGTCAGCAGAAAGCGACTGAGAAGTTTTATTTTAT^ 

TGGTCTTATTGGCCGGAATTTCAGCCTGTTTGACTCTGATGAAGTCCAACAAAACAGTAGAAAGCAATC^^ 

ATCACrCAATACATCTTTTTCTATTAAGAAGATAGAGAATGGTCAGACATTCAAGTTGTCAGACCTAGCATCT 

AGCAAGATTAAGGGGCTGGAAAATGTCTCTCCTGAACirGAGACGGTCGCAAAACTAAAAGACAAG^ 

ACTGGCGAGCAGAGCGTGGAGCGTGATGATTTATCAGCTGCAGACAATAACTTGGTTAGCnTAACGGCTCrTGAG 

GATTCATCCAAGGATGTAACCTTTACCAGTTCGGCITTCAATCTAAAAGAAGGGCGACACC^ 

TCCAAGAAAATCCTTATCCACGAAGAATTGGCTAAGAAGAACGGTCTTTCGCITCATGACAAGAT^ 

CTGGTCAGTCTGAATCTGGAAAAGGACAAACAGTAGAGTTTGAGATTATCGGCATCTTTTCTGGTAAAAAA CAAG 

AGAAATTCACAGGCTTGTCirCrGACTTCAGTGAAAATCAAGTCTTTACAGACTATGAAAGT^ 

GGGCAATAGTGAAGCTCAAGTCAGTGCAGCACGCTTCTATGTAGAAAATCCTAAGGAAATGGACGGACTCATGAA 

GCAGGTAGAAAACTTGGCCTTGGAAAATCAAGGCTACCAAGTCGAAAAGGAAAACAAGGCrT^ 

AGACTCAGTTGCAACTTTCCAAACCTTCCrGACCATCTTCCTTTATGGGATGTTGATAGCAG 

ATTCTGGTTTTGTCTCTCTGGTTGAGAGAACGGGTCTATGAAGTGGGGATTTTACTTGCACTTGGAi^^ 

GCTCGATCTTCCTACAATTCTGTTTAGAGGTAGTTTTGGTATCTCTTGGAGCTTTGC^ 

AACGCAATCACAACTTACCTACTCCAAACTCTACTAGCAAGTGGAGATCAGGCAAGCTTACAAGATACACTAGCC 

AAAGCAAGCAGTTTATCAACTAGCATCTTATCTTTTGCAGAATCCTATGTTTTTCTAGT^ 

TGTAGCCCTTTGTTrCCTATTCTTATTTAGAAAATCACCGAAAGAAATTT^ 

4142.5 

ATGTTACACAACGCATTTGCCTATGTTACAAGGAAGTTTTTCAAATCGATTGTCATCTTCCT 

GGCGAGCTTGAGTTTGGTCGGCTTGTCAATCAAGGGAGCTACTGCCAAGGCTTCTCAGGAGACCI^ 

ACCAATAGCTTCTCCATGCAAATCAATCGTCGCGTCAACCAAGGAACGCCTCGTGGTGCTGGGAATATCAAGGGT 

GAAGACATCAAAAAAATCACCGAAAACAAGGCCATTGAGTCTTATGTCAAACGTATCAACGCTATCGGAGAT^ 

ACTGGATATGACCTGATTGAAACGCCAGAAACCAAGAAGAATCTCACTGCTGATCGTGCCAAGCGTTTTGGAAGT 

AGCTTGATGATTACAGGTGTCAATGACTCCTCTAAAGAAGACAAGTTTGTCTCTGGTTCTTATAAACTAGTCGAAG 

GAGAGCACTTAACCAACGACGACAAGGATAAAATCCTCTTGCACAAGGACTTGGCAGCCAAACACGGCTGGAAA 

GTAGGGGACAAGGTTAAACTGGACTCTAATATCTACGATGCAGATAATGAAAAAGGAGCCAAGGAAACAGTTGA 

AGTGACAATCAAGGGACTCTTTGATGGTCATAATAAGTCAGCAGTAACCTACTCACAAGAACTTTACGAAAACAC 

AGCTATTACAGACATTCACACTGCTGCAAAACTTTATGGATACACAGAAGACACAGCCATTTATGGGGACGCAAC 

CTTCTTTGTAACAGCAGACAAGAACTTGGATGATGTTATGAAAGAGTTGAATGGCATCAGTGGTATCAACTGGAA 

GAGCTACACACTCGTCAAGAGCTCCTCTAACTACCCAGCTCTTGAGCAATCTATCTCTGGTATGTACAAGATGGCC 

AACCTCCTCTTCTGGGGTAGCTTGAGCTTCTCAGTTCTCCTCCTTGCCCTCTTGCTCAGCCTTTGGATC 

TCGCAAGGAAGTGGGAATTCTCCTCTCTATCGGCCTCAAGCAGGCAAGTATCTTGGGTCA 

ATTCATCACCGAATCTATCTTGATTGCTATCCCTGCTCTAGTTTCTGCTTACTTCCTAGCTAATTACACTGCCCGTG 

CAATTGGAAACACTGTCCTTGCCAATGTGACTTCAGGTGTTGCCAAACAGGCTAGTAAGGCGGCTCAAGCCTCTA 

ACCTTGGTGGTGGTGCAGAAGTAGATGGCTTTAGCAAGACCTTGTCGAGCCTAGACATTTCCATTCAGACATCAGA 

CTTTATCATCATTTTTGTCCTTGCCTTGGTTCTAGTGGTTCTCGTTATGGCGCTTGCTTCAAGCA 

AACAACCAAAAGAGCTCTTGCTGGATGGTGAATAA 
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ATGTCACAGGATAAACAAATGAAAGCTGTTTCTCCCCnTCTGCAGCGAGTTATCAATATCTCAT^ 
GGGTTGGGAGTTTGATTTTCTGTATTTGGGCTTATCAGGCTGGGATTTTACAATCC 

ATCCAGCAGGCAGGCATCTGGGGTCCACCTCTCTTTATCTTTTTACAGATTTTACAGACTGTCGTCCCTAT^^ 
5 AGGGGCCTTGACCTCGGTGGCTGGGGTCTTTATCTACGGGCACATCATCGGGACTATCTACAACTATATCGGCATC 
GTGATTGGCTGTGCCATTATCTTTTATCTAGTGCGCCTATACGGAGCTGCCTTTGTCCAGTCTGTCGTC 
GCACCTACGACAAGTACATCGACTGGCTAGATAAGGGCAATCGTTTTGACCGCTTCTTTAT^^ 
GCCCATTAGCCCAGCTGACTTTCTCTGTATGCTGGCTGCCCTGACCAAGATGAGCTTCAAGCGCTACATGA^ 
ATCATTCTGACCAAACCCTTTACCCTCGTGGTTTATACCTACGGTCTGACCTATATTATTGAC^^ 

10 GCTTTGA 
4144.2 

ATGAGAAATATGTGGGTTGTAATCAAGGAAACCrATCTTCGACATGTCGAGTCATGGAGrilUllUill'ATGGTG^ 

TTTCGCCGTTCCrCTTTTTAGGAATCTCTGTAGGAATTGGGCATCTCCAAGGTT 
15 GTGGCAGTAGTGACAACAGTGCCATCTGTAGCAGAAGGACTGAAGAATGTAAATGGTGTTAACTTCGACTATAAA 

GACGAAGCAAGTGCCAAAGAAGCAATTAAAGAAGAAAAATTAAAAGGTTATTTGACCATTGATCAA 
GTTCTAAAGGCAGTTTATCATGGCGAAACATCGCTTGAAAATGGAATTAAATTTGAGG^^ 
AACTGCAAAATCAGCnTAATCGTTCAACTGCTTCCTTGTCrCAAGAGCAGGAAAAACGC^ 
AATTCACAGAAAAGATTGATGAAGCCAAGGAAAATAAAAAGTTTATTCAAACAATTGCAGCAGGTGCC^ 

20 TCTTTCTTTATATGATTCTGATTACCTATGCGGGTGTAACAGCTCAGGAAGTTGC(^GTGAA^ 
TATGGAAGTCGTTTTTTCTAGCATAAGGGCAAGTCACTATTTCTATGCGCGGATC^ 
TAACGCATATTGGGATCTATGTTGTAGGTGGTCTGGCrGCCGTTTTGCTCTTTAA 
TCTGGTATTTTGGATCACTTGGGAGATGCTATCTCACTGAATACCTTGCTCITTAT^ 
CGTAGTCITGGCAGCCnTCCrAGGATCTATGGTTTCTCGTCCTGAGGACTCAGGGAAAGC(^ 

25 ATTTTGATTATGGGTGGTTTTTTTGGAGTGACAGCTCTAGGTGCAGCTGGTGACAATCTCCT 
TTATATTCCCTTTATTTCGACCTTCITTATGCCGTTTCGAACGATTAATG^ 

ATTTCACTTGCTATTACAGTGATTTTTGCGGTGGTAGCAACAGGATTTATCGGACGCATGTATGCTA^^ 
TCAAACGGATGATTTAGGGATTTGGAAAACCTTTAAACGTGCCITATCT^ 

30 4144.3 

ATGACAGAAACCATTAAATTGATGAAGGCTCATACTTCAGTGCGCAGGTTTAAAGAGCAAGAAAT^^ 
GACnTAAATGAGATTTTGACAGCAGCCCAGATGGCATCATCTTGGAAGAATTTCCAATCCTACTCT^ 
TACGAAGTCAAGAGAAGAAAGATGCCTTGTATGAATTGGTACCTCAAGAAGCCATTCGCCAGTCTGCTGTT^ 
TCrCTTTGTCGGAGATTTGAACCGAGCAGAAAAGGGAGCCCGACTTCATACCGACACCTTCCAACCCCAAGG 
35 GGAAGGTCTCTTGATTAGTTCGGTCGATGCAGCTCTTGCTGGACAAAACGCCTTGTTGGCAGCTGA^ 

TATGGTGGTGTGATTATCGGTTTGGTTCGATACAAGTCTGAAGAAGTGGCAGAGCTCTTTAACCTACCTGACTACA 
CCTATTCTGTCTTTGGGATGGCACTGGGTGTGCCAAATCAACATCATGATATGAAACCGAGACTGCCACTAGAGA 
ATGTTGTCrrTGAGGAAGAATACCAAGAACAGTCAACTGAGGCAATCCAAGCTTATGACCGTGTTCAGGCTGACT 
ATGCTGGGGCGCGTGCGACCACAAGCTGGAGTCAGCGCCTAGCAGAACAGTTTGGTCAAGCTGAACCAAGCTCAA 

40 CTAGAAAAAATCTTGAACAGAAGAAATTATTGTAG 
4146.1 

ATGTTAAAACTTArrGCTATTGTTGGAACAAATTCAAAACGTTCTACAAACCGTCAAT^ 
AACACTTTACTGACAAAGCTGAAATTGAACTTGTTGAAATCAAGGCCATTCCT 
45 GCAAGTACCTGCTGAAATATTGGAAATTGCTGCTAAAATCGAAGAGGCAGATGGCGTTATTATCG GTACT^ 
GTATGATCACrCTATTCCAGCTGTTTTGATGAGCGCTCITGCTTGGTTGTCTTATC 
AACCAATCATGATTACAGGTGCTTCTTACGGTACGCnTGGTTCATCTCGTGCCCAATTGCAACnTCGTC 

GAATGCTCCTGAAATCAAGGCAAATGTTCTTCCAGATGAATTCITGCTCTCACACTCTC^ 
AGTGGCGACnTGGTTGACCTTGATGTTATCAAGAAATTGGATGCCATCTTTGATGACTTCCGTATCT^ 
50 TCACAGAAAAATTACGTAATGCACAAGAATTACTTCGCAAAGATGCTGAAGACirrGACTGGGAA^ 

4146.2 

ATGAATACCTATCAATTAAATAATGGAGTAGAAATTCCAGTATTGGGATTTGGAACTTTTAAGGCTAAGGATGG^ 
GAAGAAGCCTATCGTGCAGTGTTAGAAGCCTTGAAGGCTGGTTATCGTCATATTGATACGGCGGCGATTTATCA 
55 ATGAAGAAAGTGTTGGTCAAGCAATCAAAGATAGCGGAGTTCCACGTGAAGAAATGTTCGTAACTACCAAGCTTT 
GGAATAGTCAGCAAACCTATGAGCAAACTCGTCAAGCTTTGGAAAAATCTATAGAAAAACTGGGCTTGGATTA^ 
TGGATTTGTATTTGATTCATTGGCCGAACCCAAAACCGCTCAGAGAAAATGACGCATGGAAAACTCGCAATGCGG 
AAGTTTGGAGAGCGATGGAAGACCTCTATCAAGAAGGGAAAATCCGTGCTATCGGCGTTAGCAATTTTCTTCCCC 

ATCATTTGGATGCCTTGCTTGAAACTGCAACTATCGTTCCTGCGGTCAATCAAGTTCG(^ 
60 TCAAGATCAAGTCGTAGCTTACTGTCGTGAAAAGGGAATTTTATTGGAAG(m'GGGGGCCTTTTGGACAAG^ 

ACTGTTTGATAGCAAGCAAGTCCAAGAAATAGCAGCAAATCACGGAAAATCGGTTGCTCAGATAGCCTTGGCCTG 

GAGCnTGGCAGAAGGATTTTTACCACTTCCAAAATCTGTCACAACCTCTCGTATTCAAGCTAATC^ 

GGAATTGAACTGAGTCATGAGGAGAGAGAAACCTTAAAAACGATTGCTGTTCAATCGGGTGCTCCACGAGTTGAT 

GATGTGGATTTCTAG 

65 
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4147.1 

ATGAGGTGCAAAATGCITGATCCAATTGCTATTCAACTAGGACCCCrAGCCATTCGTTGGTATGCC^ 

TGACAGGCTTGATTCnTGCGGTTTATTTGACCATGAAAGAAGCACCrAGAAAGAAGATCATACCAGACGATAT^^ 

AGATTTTATCTTAGTAGCCTTTCCCTTGGCTATTTTAGGAGCrCGTCTCTACT 

TAGTCAGAATTTAGGAGAGATTTTTGCCATTTGGAATGGTGGTTTGGCCATTTACGGTGGTTTGA^ 

CTTGTGCTCTATATCTTTGCTGACCGTAAACTCATCAATACITGGGATTTT^ 

GATTGCTCAAAGTTTGGGGCGTTGGGGTAATTTCTTTAACCAAGAAGCTTATGGTGCA^ 

TATCTACCTGGCTTTATCCGTGACCAGATGTATATTGAGGGGAGCTACCGTCAACCGACTTTCCIT^ 

TATGGAATCTGCITGGCTTTGCCrrTGATTCTGATTTTTAGACGGAAATGGA^ 

GGCCTTTTACTTGATTTGGTATGGTTTCGGTCGTATGGTTATCGAAGGTATGCGAACAGATAGTCTCATGT^ 

GCTTTCGAGTGTCCCAATGGCTGTCAGTTGTCCTTATCGGTCTCGGTATAATGATCGTTATTTATCAAAATCGAAA 

GAAGGCCCCTTACTATATTACAGAGGAGGAAAACTAA 

4147.2 

ATGGGTAAATTATCCTCAATCCTTTTAGGAACCGTTTCAGGTGCAGCTCTTGCCTTGTT^^ 

GCAAACAAGTTTGCAGTCAGGCTCAAGATrrrCTAGATGATTTGAGAGAAGATCCGGAGTATGCCAAGGAGCA^ 

TCTGTGAAAAACTGACAGAAGTTAAGGAGCAGGCTACAGATTTTGTTCTGAAAACAAAAGAACAGGT^ 

GTGAAATCACTGTGGACAGTATACTTGCTCAAACTAAATCCTATGCTTTTCAAGCGACAGAAGCATC 

AATTAAATAATCTCAAGGAGCAATGGCAAGAAAAAGCCGAAGCTCTTGATGACTCAGAAGAGATTGTGAT^ 

TAACAGAAGAATAA 

4147.3 

ATGAAAACTAAATTGATCTTTTGGGGCTCTATGCTCTTTCTCCTCTCCCTCT 

TTGGATTTTCTATCCTATGGAGATTCAGTGGCTAAACTTAACGAATCGAGTCTATCTAAAAC^ 

TACAATTTTCATATCITGATGAATTATCTGACCAATCCnTTTAGTCAGGTCT^ 

AGCAGCTGGTCTGCACCATTTCGCAGTGGTCAAGAATCTCTTTCATTTGGTTCAGCTAGTAGCT 

CCAAGTTTCTATGTCTTTGTCAATAGGATTGTGAAAAAGGACITTTTGTCTC^^ 

AGTAGTCTTACCTGTGATGATTGGACTTGGGGGAGTTTTGATTGGTTTTGACCAATTC^ 

TTCTCITTGTGGGAGATGATACCTGGCTTTTTGATCCAGCCAAGGATCCTGTTATTATGAT^^ 

TTTCnrCATGCCITCCTCCTCTTTTTTGCCCrCTATGAAA^ 

GTGA 

4149.1 

ATGACTTATCATTTTACTGAAGAATACGATATTATTGTAATTGGTGCGGGACACGCTGGGGTTGAGGCT^ 

CCGCTAGCCGTATGGGCTGTAAGGTCCTGCITGCGACCATCAATATTGAAATGCTGGCTTTCATGCCI^ 

CTCTATCGGTGGTTCTGCCAAGGGGATTGTCGTGCGTGAAGTCGATGCCCTCGGTGGCGAGATGGCCAAAACCATT 

GACAAGACTTACATCCAGATGAAGATGCTAAACACAGGGAAGGGGCCAGCTGTCCGTGCCCTTCGTGCGCAGGCT 

GACAAGGAACTTTACTCTAAGGAGATGCGCAAGACGGTTGAAAACCAAGAAAATCTGACCCTTCGTC/^^ 

ATTGATGAGATTTTGGTGGAAGATGGCAAGGTTGTCGGTGTGCGTACAGCCACCCATCAAGAATATGCTGCTAAG 

GCTGTTATTGTGACGACAGGGACTGCTCTCCGTGGGGAAATTATCATCGGAGACCTCAAGTACTCATCAGGTCCTA 

ACCACAGCTTGGCTTCTATTAACCTAGCTGACAATCTCAAGGAACTGGGTCTCGAAATCGGTCGTT^ 

AACCCCTCCACGTGTCAAGGCITCnTCTATCAATTACGATGTGACAGAAATTCAGCCAGGAGACGAAGTGCCTAAT 

CATTTCTCATACACTTCACGTGATGAGGATTATGTCAAGGACCAAGTACCATGCTGGTTGACCTATACCAATGGTA 

CCAGTCATGAGATTATCCAAAACAACCTCCACCGTGCGCCTATGTTTACAGGTGTGGTCAAGGGAGTGGGGCCTC 

GTTACTGTCCGTCGATTGAAGACAAGATTGTGCGCTTTGCGGACAAGGAACGTCACCAACTCTTCCT^ 

AGGGCGCAATACrGAGGAAGTCTATGTGCAAGGACTTTCAACCAGTCTGCCTGAGGATGTCCAGCGTGACITGGT 

GCATTCCATCAAAGGTTTGGAAAATGCAGAGATGATGCGGACAGGTTATGCTATTGA 

GTATGATATGGTCnTGCCTCATCAGTTGCGTGCGACITTGGAAACCAAGAAAATCTCAGGTCTCn^ 

CAGACAAATGGAACATCAGGTTACGAAGAGGCAGCAGGCCAAGGGATTATCGCGGGTATCAATGCGGCTCTGAA 

AATCCAAGGCAAGCCTGAATTGATTTTGAAGCGCAGTGATGGTTATATCGGGGTGATGATCGACGACTTGGTGAC 

CAAGGGAACCATTGAACCCTACCGTCTCTTGACCAGTCGTGCTGAATACCGTCTCATTCnTCGTCATGACAA^^ 

GATATGCGCTTGACTGAGATGGGACGCGAGATTGGCCTTGTGGACGATGAACGCTGGGCTCGTTTTGAAATCA^ 

AAAAATCAATTTGATAATGAGATGAAGCGCCTAGACAGTATCAAACTCAAGCCAGTCAAGGAAACCAATGCCAAG 

GTTGAGGAGATGGGCITCAAACCCTTGACCGATGCAGTGACAGCCAAGGAATTCCnTCGCCGTCCAGAAGT^ 

ACCAAGATGTGGTGGCCTTCATCGGACCAGCTGCAGAAGACnTGGATGACAAGATTATCGAATTGATTGAAACAG 

AAATCAAGTATGAAGGCTATATTTCCAAAGCCATGGACCAGGTTGCCAAGATGAAACGCATGGAAGAAAAACGCA 

TTCCGGCCAATATCGACTGGGATGACATTGATTCTATCGCAACCGAAGCCCGTCAGAAGTTCAAACTCATCAATCC 

AGAAACCATCGGCCAAGCCAGCCGTATTTCGGGAGTAAACCCAGCAGATATTTCTATTTTGATGGTGTATCTGGi^ 

GGTAAAAATCGTAGTATTTCTAAAACTCTTCAAAAATCAAAATGA 



4149.2 

ATGAAAGTATTAGCTTTTGATACGTCCAGCAAGGCTCTTTCTCTGGCTATTTTAGAGGATAAGCAGGTTC^ 
AGACGACGATTAATATTAAGAAAAATCACAGTATTACTCTTATGCCTGCCATCGATTTTTTGATGGCAAGTTTGGA 
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TTGGACACCCAAGGATTTGGACCGAATCGTGGTAGCTGAAGGGCCGGGTAGCTATACAGGCTTGCGAATTGCGGT 

AGCAACTGCTAAGACCnTAGCTCACACCCTGAACATCGAGTTGGTTGGTATGTCGAGTCTCTTGGCTCTGGTG 

CATCAACAAGAAGGTITGTTTGTCCCCTTGATGGATGCGCGTCGCAATAATGTTTATGCAGGATTTTATGA^ 

CCAAACCTGTCATGGCAGAAGCGCACCTATCTTTTGAAGAGGTGCTAGAAAAAGTCAAGGGTACTAGTCAGGTAA 

CCTTTGTCGGAGAAGTTGGCCCCnTTGTTGAGCAGATTCAAAAACACTTGCCAAGGACTGATTACAAAG^^ 

GCCCAATGCAGCTAATCTAGCrCITTTGGCCTGGGACAAGGAAGCAGACTCCTTGCATGATT^ 

CTCAAACGAGTCGAGGCTGAGGAAAACTGGCTCAAGAACCATACCGAGTCTGGCGAGTCTTACATTAAACGCCTA 

TGA 

4149.3 

ATGATAGAAATCAAGCGAATTCAACAACAGCCTGACCTAGCTCAAGCCATCTACGCTGTTATGGCAGCTGTTTACC 

TAGTCAGTCCTTGGACTCTGGAGCAAATCCAAGCAGATCTGTCCCAAGACCAGACTTGGTATGCATTGGCTTATGA 

TGGGGCAGAAGTGATTGGATTTCTAGCTGTGCAGGAGAATCTTTTTGAAGCAGAAGTCCTGCAAATCGCT 

AGGAGCITATCAGGGTCAGGGGATTGCGTCagCCTTGTTTGCTCAATTGCCGACAGACAAGGAAATT^ 

GTCAGACAGTCAAATCAACGAGCGCAAGCATTTTACAAGAAAGAAAAGATGACAGTTATCGCTGAGCGAAAGGC 

CTACTACCATGACCCAGTCGAGGACGCCATTATCATGAAGAGAGAAATAGATGAAGGATAG 

4152.2 

ATGACAAAACAAGTCTTATTAGTGGATGATGAAGAACACATTCTGAAATTGCTTGACTACCATTTA^ 

GGCITTTCTACTCAATTGGTGACAAATGGACGGAAGGCCTTAGCTTTGGCAGAAACAGi^ 

TGCnTGATATCATGTTACCACAATTAGATGGCATGGAAGTTTGTAAGCGGCTGAGAGCCAAAGGCGTCAA/^ 

CAATTATGATGGTTTCTGCGAAAAGTGATGAATTTGATAAGGTTTTGGCCTTGGAA^ 

GACCAAGCCTTTTAGCCCrAGAGAATTGCTGGCGCGTGTCAAGGCrGTCCTCAGGCGAACTA^ 

AGGAGATGATTCAGATAATATCGCTGACGATTCTTGGCTATTTGGGACCTTGAAAGTATACCCTGAGCGTCATGAA 

GTCTACAAGGCGAATAAGTTACTGAGTTTGACCCCAAAAGAATTTGAAAGCGATAAAAATCCGTT^^ 

TCAAAGTTTCGAAAGTAACCGCCCAATAA 

4154.1 

ATGACTACTTTTAAAGATGGATTTTTATGGGGTGGTGCTGTTGCTGCTCATCAACTTGAA 

GTGGCAAGGGAATTAGTGTTGCTGATGTTATGACTGCTGGTCGTCATGGAGTAGCTCGTGAAATTACTTTGGGAGT 

TTTAGAGGGTAAATATTATCCAAATCATGAGGCGATAGATTTTTATCACCGTTATAAAGAAGATATAGCAC^^ 

GCTGAAATGGGATTCAAGTGCTTCCGTACCTCTATTGCATGGACACGTATCTTTCCAAAAGGTGATGAGTTAGAGC 

CGAATGAAGAAGGATTACAGTTTTATGATAATCTTTTTGATGAATGCTTAAAGAATGGTAT^ 

TCTATCTCATTTTGAAATGCCTTATCACTTAGTGACCGAATATGGTGGTTGGAAAAATAGGAAA^ 

TTTGCrCGTTTTGCAGAAGTCGTATTTAAACGTTACAAAGATAAGGTTAAATATTGGATGACT^ 

ATAATCAAGCGAATTATCAGGAAGATTTTGCACCATTTACTAACTCAGGTATTGTATATGAGGAAGGTGATAATAG 

AGAAGCAATTATGTATCAAGCAGCACATTACGAATTAGTTGCTTCTGCACGAGCTGTAAAAATTGGTCATGAGATT 

AATCCAGATTTTCAAATAGGTTGTATGATTGCGATGTGTCCAATTTATCCAGTTACITGCAATCCTA^ 

AATGGCAATGAAAGCTATGCAGAAGCGTTATTATTTTGCTGATGTGCATGTTTTAGGTAAATATCCTGAGCATAT^ 

TTCAAGTATTGGGAACGAAAAGGTATTTCAGTTGATTTTACTGCCCAGGATAAAGAAGATTTAC^ 

TAGATTACATTGGTTTCAGTTACTATATGTCCTTTGCTATCGACTCTCATCGTGAAAATAATCC^ 

CTTGAAACAGAAGATTTAGTGAAAAATAATTATGTTAAGGCTTCTGAATGGGAGTGGCAAACT 

TTGCGTTATGCGTTAAATTGGTTTACAGACCACTATCACnTACCACTCnTTAT^ 

AGATCAAGTTGCAGCAGATGGTATGGTACATGATGATTATAGAATTGAATATCTAGGTGCCCATATTCGTGAAATG 
AAAAAGGCTGTAGTTGAAGATGGTGTTGATTTAATGGGTTATACTCCATGGGGATGTATTGATTTGGTT^ 
GTACCGGTGAAATGCGGAAACGTTATGGCTTTATTTATGTAGATAAAGATGATAATGGGAAGGGAAGTTATAA^^ 
GTTCCCCGAAAAAATCTTTTGGCTGGTATAAGGAAGTTATTTCATCTAACGGTGAATCAGTAGAATAG 

4154.2 

ATGGATCAACAAAACGGGTTGTTTGGTTTTCTTGAAAACCATGTTATGGGACCAATGGGCAAACTTG 

AAGTAGTACGTGCTATCACGGCTGCAGGTATGGCTGCTGTACCATTTACTATTGTAGGATCAATGTTTTTGGTATT 

CAGTATTTTGCCACAAGCTTTCTCATTTTGGCCAATTGTGGCAGATATTTTCTCTGC^ 

CACTTTACATGGTTGCAAACTATGCGACTATGGGTTCTCTATCTCTTTATTTCGTTCT 

ACAAAAATTTATGCAGAGGAAGAAGAACTCAATATGAATCCTCTTAATGGTGCCTTGCTTGCCT^ 

TCATGACAGTACCGCAAATCATTTTTGATGGTGGAATGATGAAGACTGTGACAAGTCTAAAAGAAGGTGCAGTAA 

TTGCAGATGGATGGGCAATGGGAAATGTAGTCGCACGTTTTGGGACAACAGGGATTTTTACCGCAATCATTATGG 

CAATTGTGACTGTTCTTATTTATCGTATGTGTGTTAAACATAATTGGGTTATTAAAATGCCTGAAGCT^ 

GGAGTTTCTCGTGGATTTACCGCTTTGGTTCCGGGATTTGTTGTTGCATTTGTTGTTAT(^^ 

GTAGCAATGGGAACAGATATTTTTAAAGTCATTGCAATTCCATTTGGTTTTGTATCCAATCTGACT 

TTGGTTTAATGATTATTTATCTATTGACTCAACTACTTTGGATTGTAGGTATCCACGGTGCGAACATTGT^^ 

TTTGTTAGTCCAATTGCTCTTGCTAACATGGCTGAAAATGCTGCTGGCGGGCACTTCGCTGTTGCAGGTG^ 

CTAATATGTTTGTAATTGCAGGTGGTTCTGGTGCAACTTTAGGACTATGTTTATATATTGCTTT^ 

GAACAGCTTAAAGCAATAGGACGAGCATCTGTAGTTCCAGCCTTATTTAATATTAATGAACGATTAATTTT^ 

TACCTATTATCTATAATCCAGCCITGGCrATACCATTTATTTTAGCACCAATGGTTACTGCT 

GCGAATTCTCTAAACTTTATTAAGCCAATTATCGCACAGGTTCCATGGCCAACTCCAGTAGGGATTGGAGCTTTCT 
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TAGGGACAGCAGATCITCGAGCTGTATTAGTTGCTCTAGTATGTGCATTTGCAGCATTCCTAGTCTATCTTC 
ATCCGTGTATATGATCAAAAATTGGTGAAAGAAGAGCAAGGTATCTAA 

4155.1 

5 ATGAAAAAATTrrATGTAAGTCCAATTTTTCCTATTCTAGTAGGATTGATTGCGT^ 

TATTTTrGTTAATAATAATCTGTTGACGGTTTTAATTTTGTTTC^^ 

ACTGAGAGTGCATTATACAAGGAGTGATGTAGAACAGATACAGTATGTAAACCACCAAGCGGAAGAAAGTTTGAC 
AGCTCTATTGGAACAGATGCCTGTAGGTGTTATGAAATTGAATTTATCTTCrGGAGAGGTTGAGTGGT^ 

TATGCTGAATTGATTTTGACCAAGGAAGATGGTGATTTTGATTTAGAAGCT^ 

10 TAGGAAATCCGTCTACTTATGCCAAGCTTGGTGAGAAGCGTTATGCTGTTCATATGGATGCTTC^ 

GTATTTTGTAGATGTATCCAGGGAACAAGCCATAACAGATGAATTGGTAACAAGTAGACCAGTGA TTGGG ATTGT 
CTCTGTGGATAATTATGATGATTTGGAGGATGAAACTTCTGAGTCAGATATTAGTCAAATCAAT^ 
AATTTTATATCAGAGTTTTCAGAAAAACACATGATGTTTTCTCGTCGGGTAAG T^^ 
TGACTACACGGTGCTTGAGGGCTTGATGAATGATAAATTTTCTGTTATTGATGC^ 

15 AGACAGTTGCCCTTGACCTTAAGTATGGGATTTTCTTATGGCGATGGAAATCATGATGAGATAGGGAAAG^ 
TGCTCAATTTGAACITGGCTGAAGTACGTGGTGGCGACCAGGTGGTTGTTAAGGAAAACGACGAAACGAA^ 
CAGTTTATTTTGGTGGTGGGTCTGCTGCTTCAATCAAGCGTACACGGACTCGTACGCGCGCTATGATGAC^ 
TTCAGATAAGATTCGGAGTGTAGATCAGGTTTTTGTAGTCGGTCACAAAAATTTAGACATGGATGCT^ 
GCTGTAGGTATGCAGTTGTTCGCCAGCAATGTGATTGAAAATAGCTATGCTCTTTATGATGAAGAACAAATGTCT 

20 CAGATATTGAACGAGCTGTTTCATTCATAGAAAAAGAAGGAGTTACGAAGTTGTTGTCTGTTAAGGATGCy^^ 
GGATGGTGACCAATCGTTCTTTGTTGATTCTTGTAGACCATTCAAAGACAGCCTTAACATTATCAAAA 
TGATTTATTTACCCAAACCATTGTTATTGACCACCATAGAAGGGATCAGGATTTTCCAGATAATGCGGTTATTACT 
TATATCGAAAGTGGTGCAAGTAGTGCCAGTGAGTTGGTAACGGAATTGATTCAGTTCCAGAATTCTAAGAAAAAT 
CGTTTGAGTCGTATGCAAGCAAGTGTCTTGATGGCTGGTATGATGTTGGATACTAAAAATTTCACCTCGCGAGTAA 

25 CTAGTCGGACATTTGATGTTGCTAGCTATCTCAGAACGCGCGGAAGTGATAGTATTGCTATCCAGGAAATCGCTGC 
GACAGATTTTGAAGAATATCGTGAGGTCAATGAACTTATTTTACAGGGGCGTAAATTAGGTTCAGATGTACTAAT^ 
GCAGAGGCTAAGGACATGAAATGCTATGATACAGTTGTTATTAGTAAGGCAGCAGATGCCATGTTAGCCATGTCA 
GGTATTGAAGCGAGTTTTGTTCTTGCGAAGAATACACAAGGATTTATCTCTATCTCAGCTCGAAGTCGTAGTA^ 
TGAATGTACAACGGATTATGGAAGAGTTAGGCGGTGGAGGCCACTTTAATTTGGCAGCAGCTCAAATTAAAGATG 

30 TAACCTTGTCAGAAGCAGGTGAAAAACTGACAGAAATTGTATTAAATGAAATGAAGGAAAAGGAGA^ 

TGA 

4156.1 

ATGAAAGAGAAAAATATGTGGAAAGAATTGTTGAATCGTGCAGGCTGGATTTTGGTCTT^ 
35 TATATCAGGTTCCCCTAGTGGTTACCTCTATTTTGACTTTAAAAGAAGTAGCCCTGCTACAGTCA GGGCT 
TGCTGGCCrTTCAATTGTGGTTCTGGCTCTATTTATTATGGGAGCTCGTAAAACC^ 
CnTTTTrrAGAGCTAAAGATTTGGCACGTTTGGGCnTGAGTTATCT 

ATTTTATTGCAACTGTCAAATGAGACGACAACAGCTAACCAGTCTCAGATTAATGATATGGTTCAAAATAG 

TGATTTCCAGrrr crr C lT GCTAGCCrrGCTTGCTCCGATTTGTC 
40 AAGATTTTCCGAGGCAAGGAGAACTTGGGATTTGTAGTCGGTACGATTGTGTTTGCTT^ 

ATTTACCTTCTTTATTGATTTATGGAGGTATGTCGACAGTTCTATCTTGGACAGCCTAC^ 
AATGTCGATCTTGCTTCACATGATTGTTAATGGGATTGCTTTCTGTTTGTTGGCT 

CATTAGGAATTTCTGTTTAAATGAAAGAGAAAAATATGTGGAAAGAATTGTTGAATCGTGCAGGCTGGAT^^ 

CITTTTACrTGCCGTCCTTTTATATCAGGTTCCCCTAGTGGTTACCTCT 
45 TACAGTCAGGGCTGATAGTTGCTGGCCnTTCAATTGTGGTTCrGGCTCTATTTATTATGGGAGCTCGT 

GTTAGCTAGTTTTAATTTTTCITTTTTTA 

GGTCAAATATACTTGGTTCCATTTTATTGCAACTGTCAAATGAGACGACAACAGCTAACCAGTCTCAGAT^ 
TATGGTTCAAAATAGTTCGTTGATTTCCAGTTT(m'CTTGCTAGCCTTGCTTGCT 

GTCGTGGGATTGTTCCTAAAAAGATTTTCCGAGGCAAGGAGAACTTGGGATTTGTAGTCGGTACGATT 

50 TTTATTGCATCAACCAAGTAATTTACCTTCITTATTGATTTATGGAGGT^ 

ACAAGACCCAACGTTTGGAAATGTCGATCTTGCTTCACATGATTGTTAATGGGATTGCTTTCTGT^ 

GTGGTGATTATGAGTCGGACATTAGGAATTTCTGTTTAA 
4156.4 

55 ATGGATACACAAAAGATTGAAGCGGCTGTAAAAATGATTATCGAGGCTGTAGGAGAGGACGCTAATCGCGAGGGC 
TTGCAGGAAACACCTGCTCGTGTAGCCCGTATGTATCAAGAGATTTTTTCAGGTCTTGGTCAAACAGCAGAGG 
ATTTGTCAAAATCCTTTGAAATTATTGACGATAATATGGTGGTAGAAAAGGATATCrnTTCCATACCATGTGTGA 
ACACCACnrCTTGCCATTTTATGGTAGAGCGCACATTGCCTACATTCCAGATGGTCGTGTGGCAGGCTTGTCTA^ 
CTAGCCCGTACGGTTGAAGTTTATTCGAAAAAACCACAAATTCAAGAACGTTTGAATATCGAAGTGGCCGATGCC 

60 TTGATGGACTATCTAGGTGCTAAAGGAGCCTTTGTTGTCATTGAGGCGGAACATATGTGTATGAGTATGCGTGGTG 
TTAGAAAACCAGGCACTGCAACCTTGACGACAGTAGCTCGTGGTCTATTTGAAACAGATAAGGATCTCCGTGACC 
AAGCTTATCGTTTAATGGGGCTATAA 

4157.2 
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ATGAAAGACTTGTTTTTAAAGAGAAAGCAGGCCTTTCGTAAGGAGTGTCTTGGTTATCT 

ACCACTTTGTCTTGTTCCTGCITGTCCTGTTGGGCTTTCTAGCCTACCAGT^ 

GAAAATCATTGGCCTATCCTTTTGTTTGTAGGAATTACGTCTGTTTTACTTTTAC^^ 

TATGGAGGCTCCAGACAAGCTCTTTCrCTTAGTTGGAGAAGAGGAAATTAAGCTCCATCTCAAGCGTCA^ 

5 CATTTCCCTAGTCTTTTGGCTCITTGTACAGACCCTTT^ 

TGGCTTGCCAGTTTTTCTGCrCTATGTGCTTTTATTGGGGGTAGGAAAATATCT 
AATTTTTCACTGAAACTGGACTGGACTGGGACTATGTTATTTCTCAAGAAAGCAAGCGT 
TTTCTTTGCCCTCTTTACGCAGGTCAAGGGAATTTCAAACAGCGTTAAGCGTCGTGCCTATCT 
AGGCTGTTCAGAAGGTGCCTGGGAAGATTTGGCAAAATCTCTATCTGCGTTCTTATCTGCGAAATGGC^ 
10 TGCTCrCAGTCnTCGTCTTCrcrrGCTTTCCTTGCTGGC 

TGGTAGTTCTCTTTAACTACCrCTTGCTCTTCCAGTTGCTGGCCCTCTATCATGC 
CAACTCTTTCCGCTGGACAAGGGGCAAAAGGAAAAAGGCITACAGGAGGTAGTTCGAGGATTGA 
TTACnTGTGGAATTAGTTGTTGGGTTGATTACCnTCCAAGAAAAACTAGCCCTTCTAG^ 
TGGTTTTACTAGTCITGTATTTGCCriTATCAGGTAAAACGTCAGATGCAGGACTAA 

15 

4158.1 

ATGAGAAAATCAATAGTATTAGCGGCAGATAATGCCTATCnTATTCCTTTAGAGACGACTATAAAGTCTGTATT^^ 

ATCACAATAGAGATGTTGATTTTTATATTCTCAACAGTGATATAGCTCCTGAATGGTTTAAA 

AATGGAAGTTGTGAATTCTACAATTCGCAGTGTACACATTGATAAAGAACTTTTTGA^ 

20 CATATAAATTATGCrTCTTACTTTAGATTTTITGCGAC^ 

CGATATCATTGTAACTGGGGAACTAGCTACnTTGTTTGAGATAGATCTCAAAGGATATTCAATTGGTG^^ 
GATGTCTATGCCTATGAAGGACGAAAATCTGGATTTAATACTGGTATGTTACTAATGGATGTTGCAAAGTGGA^ 
AACATTCTATTGTCAATAGTTTATTGGAATTAGCGGCCGAGCAGAATCAAGTTGTTCATCTTGGGGATCAGAGTAT 
TTTAAATATTTATTTTGAGGATAATTGGCTAGCCTTAGATAAAACATATAATTATATGGTGGGTATTGA^ 

25 ACCTTGCTCAAGAATGTGAACGTCTAGATGACAATCCACCTACAATTGTTCACTATGCTAGTCATGATAAACCT^ 
GAATACATATAGTATATCTAGACTACGTGAATTATGGTGGGTTTATAGAGATTTGGATTGGTCAGAGATTGCTT^ 
CAACGTTCCGATTTAAATTATTTTGAAAGAAGCAATCAGTCTAAAAAACAAGTGATGCTTGTGA 
GATATAAAACATTTAGAGTATTTAGTACAACGGTTACCTGATTGGCATTTTCATTTGGCT 

TGAGGAGCTGACCTCTCTATCACAGTATACGAATGTAACAGTATATCAAAATGTATTACATAGTAGAATTGATTGG 
30 CTATTGGACGATTCTATAGTTTATTTAGATATTAATACAGGTGGAGAGGTTTTTAATGTAGTTACAAGGGC 

AAAGTGGCAAGAAAATCnrCGCTTTTGATATCACACGTAAAAGTATGGATGATGGACTCTATGACGGTATi 1 1 
TGTGGAGAGACCAGATGATTTAGTGGATAGAATGAAGAATATAGAGATAGAGTAA 

4158.2 

35 ATGACTAAGATTTATTCGTCAATAGCAGTAAAAAAAGGACTATTTACCrCATTTCTACTGT^ 
GAAGTCGTATTATTCTCCCTTTTGTTGACCTAAATACTAAAGATTTT^ 
TCAGCCGCCCTAACAGGTGGGAATCTAAGAAGTTTATCAATTTTTTCTGTTGGATTATC^ 
TGATTTTATGGCAGATGTTTTCnTTTTCTAAACGGTTGGGTTTAACA 
AAAATGTACCTGACCTTGCTAATTGCTGTGATTCAATCCITGGCAGTTAGCTTG^^ 

40 ATTCTGCAATATTGGTTGTTCTAATGAATACAATATTGCTGATAGCAGGAACATTT^ 

TTAAATGCGAGTATGGGGATTGGAGGTTCTATTGTAATCCTCCTATCCAGTATGGTTTTAAATATTCCT 

TTTTGGAAACATTTCAGACAGTACACATTCCAACAGGGATTATTGTGTTACnTGCT^ 

TATTTACITGCCCTTATGTATCGAGCTCGCTATTTGGTTCCTGTTAATAAAATTG^ 

CTATTCTTATCTCGAAATCATGTTGAATCCrrGCAGGTGGGATGCCTTATATGTATGTGATGAG'll"l"iC l lAGTGTAC 
45 CAGCTTATTTGTTCATCnTGTTGGGATTTATTTTCCCTAATCATTCAGGGTTAGCG 
GTTGGAAAGCCnTTGTGGGTCTATGTTTATATTTCGGTCTTATTT^ 

AATGGAGAAGAGATTGCAGACCGTATGAAAAAATCTGGAGAATACATTTATGGTATTTATCCAGGTGCGGATACT 
AGTCGATTTATTAATCGATTGGTCCTTCGTTTCTCAGTCATAGGTGGTCTCTTTAATGTGATTATGGCAG 
CATGCTTTTTGTTTTGTTTGATGAAAAGTTATTACGATTGGCAATGA 
50 TGATTTTTACGATTAGAGACGAGGTCAAGGCTTTAAGGCTAAATGAGACCTATAGACCTTTGAT^ 

4158.3 

ATGTCCTCTCTTTCGGATCAAGAATTAGTAGCTAAAACAGTAGAGTTTCGTCAGCGTCTTTCCGAGGGAGAAAGTC 
TAGACGATATTTTGGTTGAAGCTTTTGCTGTGGTGCGTGAAGCAGATAAGCGGATTTTAGGGATGTT^ 

55 TGTTCAAGTCATGGGAGCTATTGTCATGCACTATGGAAATGTTGCTGAGATGAATACGGGGGAAGGTAAGACCTT 
GACAGCTACCATGCCTGTCTATTTGAACGCTTTTTCAGGAGAAGGAGTGATGGTTGTGACTCCrA^ 
TCAAAGCGTGATGCCGAGGAAATGGGTCAAGTTTATCGTTTTCTAGGATTGACCATTGGTGTACCATTTACGGAAG 
ATCCAAAGAAGGAGATGAAAGCrGAAGAAAAGAAGCnTATCrATGCTTCGGATATCATCTACACAACCy^ 
ATTTAGGTTTTGATTATCTAAATGATAACCTAGCCTCGAATGAAGAAGGTAAGTTTTTACGAC^ 

60 GATTATTGATGAAATTGATGATATCTTGCITGATAGTGCACAAACTCCrcrGATTATTGCGGG^ 

AGTCrAATTACTATGCGATCATTGATACACTTGTAACAACCrrGGTCGAAGGAGAGGATTATATCr^ 
GAAAGAGGAGGTTTGGCTCACTACTAAGGGGGCCAAGTCTGCTGAGAATTTCCTAGGGATTGATAATTTATAC^ 
GGAAGAGCATGCGTCTTTTGCTCGTCATTTGGTTTATGCGATTCGAGCTCATAAGCTCTTTACT^ 
TATATCATTCGTGGAAATGAGATGGTACTGGTTGATAAGGGAACAGGGCGTCTAATGGAAATGACTAAACTTCAA 

65 GGAGGTCTCCATCAGGCTATTGAAGCCAAGGAACATGTCAAATTATCTCCTGAGACGCGGGCTATGGCCTCGATC 
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ACCTATCAGAGTCTTTTTAAGATGTTTAATAAGATATCTGGTATGACAGGGACAGGTAAGGTCGCGGAAA^ 
TTTATTGAAACTTACAATATGTCTGTAGTACGCATTCCAACCAATCGTCCGAGACAACGGA 

TTGACTATCCAGATAATCTATATATCACTTTACCTGAAAAAGTGTATGCATCCTTGGAGTACATCAAGCAATACCA 
TGCTAAGGGAAATCCTTTACTCGTTTTTGTAGGCTCAGTTGAAATGTCrCAACT 

AAGGGATTGCCCATAATGTCCTAAATGCTAATAATGCGGCGCGTGAGGCTCAGATTATCTCCGAGTCAGGTCAGA 

TGGGGGCTGTGACAGTGGCTACCTCTATGGCAGGACGTGGTACGGATATCAAGCTTGGTAAAGGAGTCGCAGAGC 

TTGGGGGCTTGATTGTTATTGGGACTGAGCGGATGGAAAGTCAGCGGATCGACCTACAAATTCGTGGCCGTTCrG 

TCGTCAGGGAGATCCTGGTATGAGTAAATTTTTTGTATCCTTAGAGGATGATGTTATCAAGAAATT^ 

TGGGTGCATAAAAAGTACAAAGACTATCAGGTTCAAGATATGACTCAACCGGAAGTATTGAAAGGTCGTAAATAC 

CGGAAACTAGTCGAAAAGGCTCAGCATGCCAGTGATAGTGCTGGACGTTCAGCACGTCGTCAGACTCTGGAGTAT 

GCTGAAAGTATGAATATACAACGGGATATAGTCTATAAAGAGAGAAATCGTCTAATAGATGGTTCTCGTGACTTA 

GAGGATGTTGTTGTGGATATCATTGAGAGATATACAGAAGAGGTAGCGGCTGATCACTATGCTAGTCGTGAATTAT 

TGTTTCACTTTATTGTGACCAATATTAGTTTTCATGTTAAAGAGGTTCCAGATTATATA^^ 

GCAGTTCGTAGCTTTATGAAGCAGGTGATTGATAAAGAACTTTCTGAAAAGAAAGAA 

TATATGAACAGTTTTTACGACnTTCACTGCITAAAGCCATTGATG 

ACAGCTATCCATGGCTATCGGTGGTCAATCTGCTAGTCAGAAAAATCCAATCGTAGAGTACTATCAAGAAGCCTA 
CGCGGGCTTTGAAGCTATGAAAGAACAGATTCATGCGGATATGGTGCGTAATCrCCTGATGGGGCTGGTTGAGGT 
CACTCCAAAAGGTGAAATCGTGACTCATTTTCCATAA 

4158.4 

ATGATAGGGACmTCGCCGCrGCTCrTGTAGCTGTACTAGCAAATTTCATCGTCCCT 

GTGCCAATACTGAAATTGCACCACCAGATGGGATTGGGCAGGTTCTCAGCAACCTCTTGCTCAAACTGGTTG^ 

ACCCAGTCAACGCCCTGCTTACTGCTAACTATATTAGAATCTTATCTTGGGCAGTCATTTT^ 

AGAAGCCAGTAAAAATAGTCAAGAATTGCTAAAAACTATCGCTGACGTGACTTCTAAAATTGTCGAATGGATC^ 

CAATCTGGCTCCATTTGGAATCCTTGGTCITGTTTTTAAAACCATTTCTGAC^ 

TACGGTATTTTATTGGTTCTATTAGTAACGACrATGCTTTTTGTTGCCCCTGTGGTCA^ 

TTTATGAGACGCAATCCTTACCCTCTAGTTTGGAACTGCCTCCGTGTCAGCGGTGTGACAGCCTTT^ 

GTTCTGCGACTAACATTCCTGTCAACATGAAACTCTGCCATGACCTTGGACrCAACCCAGATACCTATTCT 

TATCCCACTCGGTTCTACTATCAATATGGCTGGAGTAGCGATTACCATTAACCITTTGACCCT^ 

ACTCTTGGAATTCCTGTTGACTTTGCCACAGCCTTTGTCCTCAGTGTGGTAGCAGCTATCTCATCCT 

AGGTATTGCCGGAGGTTCCCTCCTTCTTATCCCAGTTGCTTGTAGCCTTTTCGGTATTTCTA^ 

AAATTGTTGGGGTTGGTTTTGTGATTGGTGTCATCCAAGACTCATGTGAAACAGCCCTTAACTCTTCT 

CCTCTTTACCGCCGTTGCCGAATACGCAGCAACCCGTAAAAAATAA 

4158.5 

ATGTCTATTAGCCAACGTACGACCAAGCTCATCITAGCTACCTGTCTTGCCTGCCTGCnTGCTTAT^^ 

TTCGTCAGCAGTTTCGGCTGGAATTATCGCTCTCnTGAGCCTATCTGATACGCGTAGAAGTACm 

CGCAATCGTCTTTTTTCTATGCITCTAGCTCTGGCTATCGGTGTTCTAGCTT^ 

GAGTCrCGGCCTCTATCTGGCCTTCTACGTTCCTTTAGCCTACAAGATGGGCTGGGAAATTGGCATCAC 

ACTGTTTTGGTTAGCCATCTCTTGGTTCAAGAGTCAACCrCrCCAGACCnTCTAGTC^ 

TATTGGTACAGGATTTGCCTTGCTTGTTAATCTCTATATGCCTTCACGAGAAGAGGAAATCCA 

CTGGTGGAAGAAAAGTTAAAAGATATCCTCCAGCGCTTCAAATACTATTTATCCAGAGGAGACGGACGCA^ 

GCACAGCTGGTAGCAGAATTAGACACGCTTTTGAAAGAAGCCCTCAGACTGGTCTATTTGGATCACTCT 

TCTTTCACCAGACAGACTACCATATCCACTACnTTGAGATGAGACAGCGACAAAGTCG TATCC TGAGAAACAT^ 

CCCAACAGATTAACACTTGTCACCTTGCCGCCAGTGAAAGCCTGATCTTAGCGCAACrCTT^ 

TCAACTGAGCCAGACCAATCCTGCTTCTGATTTGCTAGATGAAATTGAACGTTATCTGG 

AGTCTGCCCAAGACAAGAGAAGAATTTGAAACCCGCGCCACCCTTCnTCAACTCCTACGTGAAGCCAAAACC^ 
ATCCAAGTAAAAGTTGATTTTTACCAAAAATATAGACAGTAA 

4158.6 

ATGGAAATCATGTCGCTTGCGATTGCTGTTTTTGCCGTCATCATTGGTTTAGTCATTGGATATGTCAGCAT 

TAAGATGAAATCATCTCAGGAAGCTGCAGAGTTGATGCTTTTAAATGCTGAACAAGAAGCAACTAATTTACGTGG 

ACAAGCTGAGCGTGAAGCGGATTTACTTGTTAATGAAGCCAAACGTGAAAGCAAGTCTCTTAAAAAAGAAGCACT 

ATTGGAGGCCAAAGAAGAAGCCAGAAAATACCGTGAAGAAGTGGACGCTGAATTCAAATCAGAACGTCAAGAAC 

TCAAACAAATCGAAAGTCGTTTGACAGAGAGAGCrACTAGCCTTGACCGTAAGGACGACAATTTGACGAGTA/^ 

AACAAACACTTGAACAAAAAGAACAAAGTATTTCTGATAGAGCGAAAAACCTTGATGCGCGTGAAGAGCA^ 

AGGAAGTCGAAAGACAAAAAGAAGCAGAACTAGAGCGTATTGGTGCGCTGTCTCAGGCAGAAGCACGAGATATT 

ATCTTGGCTCAGACAGAGGAAAACITGACCAGGGAGATTGCCAGTCGCATTCGCGAAGCTGAGCAAGAGGTCAAG 

GAACGTTCTGACAAAATGGCCAAGGACATCTTGGTTCAAGCTATGCAACGTATCGCTGGTGAATATGTAGCGGAG 

TCAACAAACTCAACAGTTCATCTGCCAGACGATACTATGAAGGGACGCATTATTGGTCGTGAAGGTCGTAACATT 

CGTACCTTTGAAAGTTTGACAGGGGTCGATGTGATTATCGACGATACACCAGAAGTGGTGACCTTGTCAGGATTTG 

ATCCGATTCGTCGTGAGATTGCCCGTATGACTATGGAAATGTTGCTCAAAGATGGTCGTATACATCCAGCTCGTAT 

CGAAGAGTTGGTTGAGAAAAACCGTCAAGAGATTGACAATAAGATTCGTGAATACGGTGAGGCTGCTGCCTATGA 

AATTGGTGCGCCAAAC(m'CATCCAGACTTGATGAAGATTATGGGACGTTTGCAGTTCCGTACTTCATATGGACAA 

AATGTTTTGCGCCATTCGATTGAGGTTGCTAAGTTGGCTGGTATCATGGCGAGCGAACTTGGTGAAAATGCGGCTC 
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TTGCCCGTCGTGCTGGATTCCTTCACGATATCGGGAAAGCCATTGACCATGAGGTTGAAGGTAGCCACGTTGAAAT 
CGGTATGGAATTGGCCCGTAAGTACAAGGAACCCCCAGTTGTGGTGAATACGATTGCTAGTCACCACGGAGATGT 
TGAAGCTGAGAGCGTGATAGCAGTTATCGTCGCTGCAGCAGATGCCTTGAGCGCAGCCCGTCCAGGTGCTCGTAG 
TGAGTCTCnTGAAAGCTACATCAAGCGTCTCCATGATTTGGAAGAAATTGCTAACGGCTTTGAAGGAGTGC^^ 
5 AGCTTTGCCCTTCAAGCAGGACGTGAAATTCGTATCATGGTCAATCCAGGAAAAATCAAGGACGACAAAGTCACA 
ATCnrGGCTCACAAAGTTCGTAAGAAAATTGAAAACAATCTCGATTATCCAGGAAATATCAAGGTAACCGTGAT^ 
CGCGAGCTTCGTGCAGTAGATTATGCTAAATAA 

4158.7 

10 ATGATGTTAAAACCCTCTATTGATACCTTGCTCGACAAGGTTCCTTCAAAATATTCACTCGTAATC^ 

AACGTGCCCACGAATTGGAAGCAGGTGCCCCAGCAACTCAAGGTTTCAAGTCTGAAAAATCAACTCTTCGCGC^ 

TAGAAGAAATCGAATCAGGAAACGTTACAATTCACCCAGATCCAGAAGGAAAACGTGAAGCAGTGCGTCGCCGTA 

TCGAAGAAGAAAAACGCCGCAAAGAAGAAGAAGAAAAGAAAATCAAAGAGCAAATTGCTAAAGAAi^^ 

TGGTGAAAAAATTTAA 

15 

4161.1 

ATGTCAGCATATCAATTACCGACCGTATGGCAGGATGAAGCTAGTAATCAAGGAGCT TTTAC GGGGCTAAACAGA 
CCAACAGCAGGTGCCCGTTTCGAACAAAACTTGCCAAAAGGAGAACAAGCTTTTCAGC^ 
CCAAATGGTGTGAAGGTTACTATCTTATTGGAAGAATTACTAGAAGCTGGTTTTAAGGA^ 
20 ATAAGATTGCTATCATGGATGGGGATCAATTCGGATCAGACTTTGTGAAGCTCAATCCAAATTCCAAGATTCCAGC 
CTTATTGGACCAGTCAGGTACTGAAAACGTAAGAGTCTTTGAGTCTGCTCATATTCri^ 
TTTGGAGCCTTTTTACCAAGTAATCCTGTGGAAAAGGTAGAAGTTTTGAATTGGCT 
CACCTTTTCTAGGTGGGGGATTTGGACATTTCTTCAATTATGCTCCTGAAAAAT^ 

ACGATGGAAGTGAAACGCCAGTTGGATTTATTGGATAAGGAATTGGCTCAGAAACCTTATATTGCAGGCAATGAC 
25 TATACGATTGCAGATATTGCTATCTGGTCTTGGTATGGACAGTTAGTTCAAGGAAATCITTACCAAGG^ 

AATTCTTGGATGCCTCAAGTTATCAAAATCTAGTAAAATGGGCAGAAAAAATTGCCAATCGTCCAGCTGT^ 
GTGGCTTGGAAGTAACTTATACAGAAATTAAATAG 

4161.2 

30 TTGGCAAGCTTGATCACTTCTATCATCATGTTCTATGTCGGTTTCGATGTTCTAAGAGATACCATTCAAAAGAT^ 
CAGTCGGGAAGAAACGGTCATTGATCCTCTTGGTGCAACTCTAGGAATCATTTCTGCAGCGATTATGT^ 
TATCTCTACAATACTCGCCTCAGTAAGAAATCCAACTCCAATGCGCTGAAGGCAGCTGCTAAGGACAATCTTTCTG 
ACGCTGTTACCTCACTTGGAACCGCCATTGCCATCCTAGCTAGTAGTTTCAATTATCCGATTGTGGATAAACTGGT 

TGCTATCATCATCACTTTCTTTATCTTGAAGACTGCCTATGATATCTTCATCGAGTCT^ 
35 GCTTTGACGACCGCCTGCTCGAGGACTACCAAAAGGCTATCATGGAAATTCCCAAAATCAGCAAGGTCAAATCGC 
AAAGAGGTCGCACCTACGGTAGCAACATCTACCTGGATATTACACTAGAGATGAATCCTGACTTGTCrGTT^ 
AAGCCATGAAATCGCGGATCAGGTCGAGTCTATGCTGGAGGAGCGTTTTGGCGTCTTTGATACCGATGTCCATATC 
GAACCAGCACCTATCCCTGAGGATGAAATTTTAGACAATGTCTATAAAAAATTGCITATGCGTO 
ACCAAGGAAACCAACTAGAAGAACTCTTGACTGATGATTTTGTCTATATTCGCCAAGATGG^^ 
40 AAGAGGCnTATAAGACCAAAAAAGAGTTAAATTCTGCTATCAAGGACATTCAAATTACn^ 

CCAAACTCATCTGCTATGAGTTAGATGGTATCATCCATACCAGTATCTGGCGTCGCCACGAAACCTGGCAAAATAT 

CTTTCATCAAGAAACCAAAAAAGAATAG 

4162.1 

45 ATGACAATTAAACTAGTAGCAACGGATATGGACGGAACCTTCCTAGATGGGAATGGACGCTTTGATATGGATCGT 

CTCAAGTCrCTCTTGGTTTCCTACAAGGAAAAAGGGATTTACnTTGCGGTAGC^ 
TAGAAAAATTATTTGCTGGTGTTCGTGATGACATTATTTTCATCGCGGAAAATGGCAGT^ 
TCAGGACnTGTATGAAGCGACTATGTCTCGTGACTTTTATCTGGCAACTTTTGAAAAGCTG^ 
GTAGATATCAATAAACTGCTCTTGACGGGTAAGAAGGGTTCATATGTTCTAGATACGGTTGATGAG ACCTA TTTGA 

50 AAGTGAGTCAGCACrATAATGAAAATATCCAAAAAGTAGCGAGTTTGGAAGATATCACAGATGACATTTTCAAAT 
TTACAACCAACITCACAGAAGAAACGCTGGAAGATGGGGAGGCnTGGGTAAACGAAAACGTTCCrGGTGTTAAGG 
CCATGACAACTGGCTTTGAATCCATTGATATTGTTCTGGACTATGTCGATAAGGGAGTGGCCATTGTTGAATTAGT 
TAAAAAACTTGGTATCACAATGGATCAGGTCATGGCTTTTGGAGACAATCTTAATGACTTACATATGATGCAGG^ 
GTGGGACATCCTGTAGCTCCTGAAAATGCACGACCTGAAATTTTAGAATTAGCAAAGACTGTGATTGGTCACC^^ 

5 5 AGG AACGGTCGGTT AT AGCTT AT ATGG AGGGCTT AT AA 

4162.2 

ATGGCAGATATAAAATTGArrGCATTGGACTTGGACGGGACCTTGCTGACTACTGATAAAAGGCTGACGGATCGT 
ACCAAGGAAACCTTGCAAGCTGCGCGTGATCGTGGTATCAAGGTCGTATTGACAACTGGTCGTCCCTTAAAAGCC 

60 ATGGATTTCTTTCTCCATGAGTTAGGGACTGACGGTCAGGAAGATGAGTATACCATTACTTTTAATGGTGGATTAG 
TTCAGAAAAATACAGGAGAAATCCTTGATAAAACAGTCTTTTCATATGATGATGTGGCACGTTTGTATGAAGAAAC 
AGAGAAATTATCACTGCCrCTTGATGCCATCTCAGAAGGAACAGTTTATCAAATCCAATCGGACCAAGAAAGTCT 
rrATGCCAAATTCAATCCAGCrTTGACCTTTGTTCCAGTGGACTTTGAAGACrrATCTAGTCAAATGACCTAC/^ 
AAATGCGTGACTGCCTTTGCTCAAGAACCCITGGATGCAGCCATTCAGAAGATTTCTCCAGAATTGTTTGACCAAT 

65 ATGAAATCTTTAAATCACGTGAAATGTTGCTAGAATGGTCACCAAAGAATGTTCATAAAGCAACAGGTTTGGCAA 
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AACTAATCAGCCATCITGGAATCGACCAAAGTCAAGTGATGGCTTGTGGTGACGAGGCCAATGACCTCTCTATGA 
TTGAATGGGCAGGTCTTGGTGTTGCTATGCAAAACGCTGTTCCTGAAGTAAAGGCAGCCGCAAATGTAGTGACGC 
CGATGACCAACGATGAGGAAGCTGTCGCCTGGGCTATCGAAGAATATGTGCTAAAGGAGAACTAA 

5 4164.2 

ATGGAAAGTTTACTTATTCTATTATTAATTGCCAATCTAGCTGGTCTCTTTCTGAT^ 

AGGAGAAACACTTAAGTAAGAGCrrGGAGGATCAGGCAGATCATTTGTCAGACCAGTTGGATTACCGCTT^ 
AAGCCAGACAAGCCAGCCAGTTAGACCAAAAAGATTTGGAAGTGGTTGTCAGCGACCGTTTGCAAGAAGTGCGGA 
TTGAATTGCACCAAGGTCTGACCCAAGTCCGTCAAGAAATGACAGATAATCTCCTCCAAACTAGAGACAAGACAG 
10 ACCAACGTCTCCAAGCCTTGCAGGAATCAAATGAGCAACGTTTGGAACAAATGCGCCAGACGGTCGAGGAAAAAC 

TAGAAAAGACCTTGCAGACACGCTTACAGGCTTCCITTGAGACAGTTTCTA/^ 
CCTTGGAGAAATGCAGACAGTTGCCCGTGATGTCGGAGCrCTTAACAAGGTTCTCTCTG 

GATTCrGGGAGAATTGCAACTGGGGCAAATTATTGAAGACATCATGACACCTGCCCAGTACGAACGAGAATACGC 
AACGGTTGAAAACTCTAGTGAACGAGTGGAGTATGCCATCAAGTTACCCGGACAAGGCGACCAAGAATACGTCTA 

15 TCTGCCAATTGACTCTAAGTTTCCACTGGCAGATTATTACCGCTTGGAAGAAGCCTATGAGACAG 

GAGATTGAACGCTGTCGTAAGTCACTCCTAGCAAGCGTCAAGCGCTTTGCTAGGGATATTAGGAACAAGTACATA 
GCACCACCTCGGACGACCAATTTTGGAGTTTTGTTTGTTCCGACAGAAGGTCTCTACTCA 
CGGTCTTCTTTGATGATTTGAGACGGGAAGAACAGATTATTGTTGCAGGACCAAGTACCCTATC^ 
CTCCCTATCAGTTGGTTTCAAGACCCTTAATATCCAAAAGAGTGCCGACCATATCAGCAAGACTC^ 

20 AAGACCGAGTTTGGCAAGTTTGGTGGTATTCTGGTCAAGGCACAAAAACATCTCCAACATGCCTCTGGCAATAT^ 
ATGAATTATTAAACCGTCGTACCATAGCTATCGAGCGGACGCTCCGTCACATTGAGTTGTCAGAAGGTGAGCCTGC 
GCnTGATCTACrCCATTTTCAAGAAAATGAGGAAGAATATGAAGATTAG 

4164.3 

25 ATGAAGATTAGTCACATGAAAAAAGATGAGTTATTTGAAGGCTTTTACCTAATCA^ 

ACTCGAGCTGGGAAAAACTACCTAGCCTTTACCTTCCAAGATGATAGTGGCGAGATTGATGGGAAGCT 
GCCCAACCTCATAACATTGAGGCCITTACCGCAGGTAAGGTTGTCCACATGAAAGGACGCCGAGAAGTTTAT 
AATACCCCTCAAGTCAATCAAATTACrCTCCGCCTGCCTCAAGCTGGTGAACCCAATGACCCAGCTGATT^ 
TCAAGTCACCAGTTGATGTCAAGGAAATTCGTGACTACATGTCGCAAATGATTTTCAAAATTGAAAATCCTGT 

30 GCAACGGATTGTCCGAAATCTCTACACCAAGTATGATAAGGAATTCTACTCCTATCCAGCTGCCAAGACCAACCA 
CCATGCCTTTGAAACGGGCTTGGCCTATCATACGGCGACCATGGTGCGTTTGGCAGACGCTATTAGCGAAGTT^ 
CCTCAGCTCAATAAGAGCCTGCTCTATGCGGGGATTATGTTGCATGACITAGCTAAGGTCATCGAGTTGACGGGGC 
CAGACCAGACAGAGTACACAGTGCGAGGTAATCTTCTTGGACATATCGCTCTCATTGATAGCGAAATTACCAAGA 
CAGTTATGGAACrCGGCATCGATGATACCAAGGAAGAAGTCGTTTTGCTTCGTCATGTCATCCTCAGTCACCACGG 

35 CITGCTTGAGTATGGAAGCCCAGTCCGTCCACGCATTATGGAAGCAGAGATTATCCATATGATTGACAATCTGGAT 
GCAAGCATGATGATGATGTCAACAGCTCTTGCTTTGGTGGATAAAGGAGAGATGACCAATAAAATCTTCGCTATG 

GATAATCGTTCCTTCTATAAACCAGATTTAGATTAA 
4166.2 

40 ATGAGTGAAAAAGCTAAAAAAGGGTTTAAGATGCCTTCATCTTACACCGTATTATTGATAATCAT^^ 
CAGTGCTAACITGGTTTATCCCTGCGGGGGCCTTTATAGAAGGTATTTACGAGACTCAGCCT 

GATTTGGGATGTCCTCATGGCACCGATTCGGGCTATGCTAGGTACTCATCCAGAGGAAGGTTCGCTCATTAAAG>^ 

ACGAGCGCAGCGATTGATGTAGCCnTCTTCATCCTTATGGTTGGTGGTTTCCTTGGCATTGTC 
CTCTTGACGTAGGGATTGCCTCTATCGTGAAGAAGTATAAGGGCCGCGAAAAAATGTTAATTTTGGTACTGAT^ 

45 TTTGTTTGCCCTCGGTGGTACAACTTATGGTATGGGTGAAGAAACAATGGCCnTCTA 
ATGATGGCCGTTGGTTTTGATAGCCTGACrGGTGTTGCAATTATTTTGCTCGGTTCT^ 

TACTCTGAATCCATTTGCGACAGGTATTGCTTCAGCGACTGCGGGAGTTGGTACAGGGGACGGTATCGTAC^ 
CTGATCnrCTGGGTTACCITGACTGCTCnTAGTACnTGGTTTGTTTAC^ 

GACTAAGTCACTGGTTTATAGTACTCGCAAAGAAGATTTGAAACACTTTAACGTAGAAGAATC^ 
50 TCTACACTTAGCAGCAAACAAAAATCAGTTCTCITCTTATTTGTGTTGACAT^ 

TCCATGGACAGACCTTGGCGTTACCATTTTTGATGACnTTAATACITGGTTGACT 

TTGTCGGTTCATCTACITCTGCACTAGGTACTTGGTACnTCCCAGAAGGCGCAATGCTCT^ 

CTGATTGGTGTTATTTATGGTCrrAAAGAAGATAAGATTATCTCTTCC^ 

TGTTGCCTTGATCGTAGCGATTGCTCGTGGTATTCAAGTTATCATGAACGACGGTATGATTACCGATACAATCCTC 
55 AACTGGGGTAAAGAAGGCTTGAGCGGTCTATCTTCACAAGTCrrTATCGTTGTAACTT^^ 
GTCATTCTTGATCCCATCTTCATCTGGTCTTGCCAGCGCAACTATGGGTATCATGGC 
TCCACnTGGAGAATTTGTAAATGTCCGTCCTAGCTTGATTATCACTGCTTACCAATCTGCT^ 

TGATTGCACCAACATCTGGTATTGTGATGGGAGCTCITGCACTTGGACGTATCAACATTGGTACTTGGTGGAAAT^ 
CATGGGCAAACTCGTAGTCGCTATTATTGTAGTGACCATCGCCCrrCTTCTCCTTGGAACCTTCC^ 
60 AA 

4166.3 

ATGAAAATAGATATAACAAATCAAGTTAAAGATGAATTTCrTATATCATTAAAAACCTTGATT^ 
TACTCAATGAAGGAGAAAATGGAACACCTTTTGGACAAGCAATCCAAGATGTCCTAGAAAAAACITTAG 
65 GTCGAGACATAGGTTTCACTACCTATCTTGACCCTAAAGGTTATTACGGATATGCAGAAATCGGTCAGGGAGCAG 
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AGCTTCTGGCCATTCTCTGTCATTTGGATGTTGTTCCATCAGGTGATGAAGCAGATTGGCAGACACCGCCATTTGA 

AGCAACTATCAAAGACGGCTGGGTATTCGGACGTGGTGTCCAAGATGATAAAGGCCCTTCGCTCGCAGCTCTCTA 

TGCAGTAAAAAGCTTGCTGGACCAAGGTATTCAGTTCAAAAAGCGCGTACGCTTTATCTTTGGTACCGATGAGG^ 

AACCCTCTGGCGCTGCATGGCACGCTACAATACCATCGAAGAACAGGCCAGTATGGGCTTTGCACCTGACTCATC 

TTTTCCTCTGACCTATGCTGAAAAAGGGCTTCTACAGGTCAAACTTCATGGCCCTGGATCGGA 

GAAGTAGGAGGCGCCTTTAACGTTGTACCAGACAAGGCCAACTACCAAGGTCTCCTCrrATGAACAGGT^ 

GGTCTCAAAGAAGCTGGTTATGATTACCAAACCACTGAACAAACCGTAACGGTTCTCGGAGTGCCAAAGCATGCT 

AAGGATGCTAGTCAAGGTATCAATGCTGTCATCCGACTAGCTACCATTCITGCrCCTCTCCAAGAACACCCT 

TCAGTTTTCTTGCAACACAAGCAGGTCAAGACGGCACAGGAAGACAAATCTTTGGTGATATAGCAGATGAACC^ 

CTGGTCACCTATCCTTTAATGTCGCAGGTCTCATGATCAATCATGAACGTTCTGAAATCCGTATTGACATTCGGAC 

TCCTGTCTTAGCTGACAAGGAAGAACTAGTAGAGTTGCTTACAAGATGTGCACAAAACTACCAACTC^ 

AGAGTTTGACTATCTAGCGCCTCTATACGTCGCAGAAGACAGTAAACTCGTTAGCACACTGATGCAAATCTACC 

AGAAAAGACTGGCGATAACAGTCCTGCTATTTCATCCGGTGGTGCCACTTTTGCTCGCACCATGC 

GCCTTCGGCGCCTTATTCCCAGGAGCGAAGCAGACAGAACATCAGGCAAATGAATGTGCCGTTCTAGAAGAT^ 

TACCGTGCrATGGATATTTATGCCGAAGCCGTCTATCGACTTGCAACTTAA 

4169.1 

ATGTCTAATTCATTTGTCAAGTTGTTAGTCTCTCAATTATTTGCAAATTTAGCAGATATT^ 

CATTGCTAACATATACATTATTTCAAAATCAGTAATTGCCACATCACTAGTTCCTATCTTAATA 

TTGTTGCGAGTCTTTTAGTTCCGTTGGTTACTAAAAGGTTAGCGCTAAATAGGGTTTTA^ 

AAGACTATATTATTGGCGATACTGGTAGGAATGTTTACCGTAATGCAATCCGTAGCGCCTTTGGTGACCTATCT 

TTGTTGTTGCAATTTCCATACTAGATGGTTTTGCAGCACCCGTTTCCTATGCTATTGTGCCACGCT 

TTGGGTAAGGCTAATTCAGCCITATCAATGACTGGTGAAGCrGTTCAATTGATAGGTTGGGGAT^ 

TGTTTGCAACAATTGGTCTGTTACCTACCACGTGTATCAATTTAGTCTTGTATATCATTTCTAGCT^ 

TTTCITCCTAACGCTGAAGTGGAGGTGTTAGAGTCAGAAACTAATCTTGAAATTTTG 

TTGCTAGAAATCCTAGATTAAGACTTTTTGTATCAGCAAATTTATTGGAAATT^^ 

TCCATTATACTTGTTTTTGTAACGGAGTTATTAAATAAAACGGAAAGTTACTGGGGATATTCT 

CTATTGGTATTATAATTAGTGGCTTAATTGCTTTTAGGCTATCTGAAAAGTTCCT^ 

ATTATTCACCCCAAATCTAAAAACCATCCAGAATCCTTGCCTTAGCTTAGATCCTGGATGGTTTCTli^ 

ATGGGTGTTTTTTACTAGACAAAAAAGAGTTTCCCCTTTATGGTATAAGTGTAGAAAA^ 

AAACrCACATGAACAGTTTACCAAATCATCACITCCAAAACAAGTCTTTTTACCAACTAT 

TTTAACCCAGTATGGTGGTCITATCrnilTCAGGAACri'lll"iCCCAGTTGAAACTAAAAGAGC^ 

ATTTAGTAACGAATGACCAACGCCGCTACTGTCGTTATTCGGATTCAGATATCCTTGTCCAGTTCCTCTT^ 

TTAACAGGTTATGGAACGGACTATGCTTGTAAAGAATTGTCAGCTGATGCCTACTT 

TCCAAAATTGTTGGAAGGAGGGCAGCTTGCTTCACAGCCAACCTTATCCCGTTTTC^ 

ACAGTCCATAGTTTGCGATGCCTCAACCTTGAATTGGTCGAATTCTTTTTACAGTTTCACC^ 

TTGTAGATATCGATTCTACCCATTTCACAACTTATGGCAAGCAAGAAGGTGTTGCTTATAACGCCCACTATC 

TCATGGCTATCATCCTCTTTATGcmTCGAGGGGAAGACAGGTTATTGTTTCAATGCCCAGCTT C^ 

GTTATTGTTCTGAAGAGGCAGACAGCTTTATCACACCTGTTTTAGAACGGTTTAATC^ 

TAGTGGCrTTGCGACCCCAAAATTATACGATTTAATTGAAAAAACAGGGCAATACT 

AAATACTGTTCTGAGCCGTCTTGGAGACCITTCCCTCCCTTGCCCACAGGATGAGGACr^ 

TCCGCCTACTCAGAAACTCTCTATCAAGCAGGATCTTGGTCGCACAAGCGTCGTGTCTGCCAGTTCTCTGAAC^ 

AAGAAGGAAACITGTTCTACGATGTTATTTCTCTCGTTACAAATATGACGAGTGG AACAAGC CAA^ 

GCTTTATCGTGGACGTGGTCAAGCCGAGAATTTCATCAAGGAGATGAAGGAGGGATTTTTTGGCC^^ 

TAGTTCAACCTTAATCAAAAACGAAGTTCGTATGATGATGAGCTGTATCGCCTACAATCTCTATCTTT^ 

CATCTAGCTGGAGGTGACnTCCAAACnTTAACAATCAAACGCTTCCGCCATCITT^ 

GTGTTCGAACAGGACGCAAGCAGCTCCTCAAATTGTCTAGTCTCTATGCCTATTCCGAAT^ 

TTCTAGGATTAGAAAAGTCAACCTGAATCTTCCTGTTCCnTATGAACCACCTAGAAGAA^ 

CATTAA 

4169.3 

ATGATGGAGTTTTTTCAACAGCTTCCTCATTTAGAGCCATATGGCAATCCTCAGTATTTTGTTT 
AACCITGCCCATCTTTATAGGTCTCTTTTTCAAGAAACGCTTTGCCTGGTA 

TTGTCACCATGTTGGTGGGTGGAAAGACCAATCAACTAGCTGCCTTGGGTATTTACCTTTGCTGGGAAATATTGCT 

CCTGCTTTTCTACAAGCATTATCGAAAAAGCAAGGATGGCAAGTGGGTCITCTACr^ 

CTTCCGATTATCITTGTCAAGGTGCAACCAGCrATCAATGGAACGCAGTCTTTGC^ 

CCTGACCTTTCGTTCGGTTGGAATTGTCATCGAGCTGAGAGATGGAGTGATTAAGGATTTTACCCTCTGGGA^ 
CTCCGTTTCCTTCTCITCATGCCAACTTTCTCGAGTGGTCCAATCGATCGCTTTAAGCGATT^ 

GGCTATTCCTGAGCGAGATGAGTTGATGGATATGCTGGATGAATCTGTCCGCTATATCATGTGGGGCTTTTTGTAT 

AAGTTTATCCTAGCTCATGTTTTAGGAGAGACCITACTACCTCCrCTGAAGA^ 

TCnTTAATCTCTATGCCITGGCAGTTATGTATACITTTGGTCTGGAACT 

TTGCITTGGCCATCTCAAACTTGATGGGAATCCGTAGCCCTATCAACTTTAACAAGCCCT^^ 
AAAGGAGTTTTGGAATCGCTGGCATATGAGTCTGTCCTTCTGGTTCCGTGACTTTGTCnTTAT^ 
GTGTTAACCAGAAAGAAAGTCITTAAAAATCGTAATGTAACCTCAAGCATGGCCTACATTGTAAATATGCT 
TGGGATTTTGGCATGGTGTGACCTGGTACTATATCGCCTATGGACTCTTTCATGGACTAGGCTTGGTCATCAATGA 
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TGCCTGGGTTCGCAAGAAAAAAACGCTCAATAAGGAACGGAAAAAAGCAGGGAAGGCTGCCCTACCTGAGAATC 

GCTGGATTCAGTTGCTTGGCATGGTTGTCACTTTCCATGTTGTCATGTTGTCATTCTTAATC^^ 

ATAATCTATGGTTTAAAAAATAA 

4169.4 

ATGCTTAAACGCTTATGGATGATCTTCGGACCGGTCTTGATCGCTGGTTTGTTGGTTTTTCT 

CCTACTGAGATGCATCATAATCTAGGAGCTGAAAAGCGTTCAGCAGTGGCTACTACTATCGATAGTTTTAAGGAGC 

GAAGTCAAAAAGTCAGAGCACTATCTGATCCAAATGTGCGTTTTGTTCCCTTCTTTGGCTCTAGTGAATGG^^ 

TTTTGACGGTGCTCATCCTGCGGTATTAGCTGAGAAATACAATCGTTCCTACCGTCCTTATCTTTTAGGACA 

GGAGCTGCATCGCTTAACCAATATTTTGGAATGCAACAGATGTTACCACAGCTGGAGAATAAACAAGTTGTGTAT 

GTTATCTCACCTCAGTGGTTCAGTAAAAATGGCTATGATCCAGCAGCCTTCCAGCAGTATTTTAATGGAGACC^ 

TGACTAGTTTTCTGAAACATCAATCTGGGGATCAGGCTAGTCAATATGCAGCGACTCGCTTACT 

AAACGTAGCTATGAAGGACCrGGTTCAGAAGTTGGCAAGTAAAGAAGAATTGTCGACAGCAGACAATGAAATGAT 

TGAATTATTGGCTCGTTTTAATGAACGCCAAGCTTCCTTTTTTGGTCAGTT^ 

ATAAGCATGTAGCTAAGTATTTAAAAATCITGCCAGACCAGTTTTCTTATCAGGCAATAGAAGATGCT 
AGATGCTGAAAAAAATACirCCAATAATGAGATGGGAATGGAAAATTATTTCTATAATGAGCAGAT 
TTTGAAGAAATTAAAGGATTCTCAGAAAAGCTTTACCTATCTCAAGTCGCCAGAGTATAATGAC^ 
TTAACACAGTTTTCTAAATCTAAGGTAAACCCGATTTTTATCATTCCACCTGT^ 

CTGGTCTACGAGAGGATATGTACCAACAAACGGTGCAGAAGATTCGCTACCAGTTAGAAAGTCAAGGTTTTACCA 
ATATAGCAGATTTTTCTAAGGACGGCGGGGAGCCnTrCITTATGAAGGACACCATT 
GTTGGCnTTTGACAAGGCAGTTGATCCTTTCCTATCCAATCCCACACCAGCTCCGACTTACCATCT 
TTTTTCAGCAAAGATTGGGCGACITATGATGGAGATGTCAAAGAATTTCAATA^ 

4169.6 

ATGGAGAAAAACCTCAAGGCTTTGAAACAAACAACAGACCAAGAAGGCCCAGCAATTGAACCTGAAAAGGCAGA 
GGATACCAAGACAGTCCAAAATGGTTACTTCGAGGATGCAGCTGTCAAGGACCGCACCTTGAGTGACTATGCAGG 
TAACTGGCAATCAGTTTATCCTTTCCTTGAAGACGGCACGTTTGACCAAGTCTTTGACT 

GGTAAGATGACCCAGGCTGAGTACAAGGCTTACTATACAAAAGGCTATCATACAGATGTGACTAAGATTAACATT 

ACTGATAATACTATGGAATTTGTTCAAGGTGGACAAAGCAAGAAATACACITACAAGTATGTCGGTAAGAAAA 

TTGACTTACAAGAAAGGCAATCGTGGCGTGCGTTTCCTCTTTGAAGCCACAGATGCTGACGCTGGA 

ATGTTCAGTTTAGTGACCACAATGTTGCCCCAGTTAAGGCAGAACATTTCCATATCTTCnTTGGAGG 

AGAAGCCCTCTTTGAAGAAATGGACAACTGGCCAACCTACTACCCAGATAACCTATCTGGCCAAGAi^ 

AGAAATGTTGGCGCATTGA 

4170.3 

ATGAAAGATGGTCATTTGCTAGCCCATCATATTCGTTTGTTGAATGGGCGGATTTTTCAAAAGTT 

ATCCTGAGGCTCTTTATAGGGGTGAACAGGGCAAGATTTTAGCGGTTTTATGGAATAGTGAAACTGGCTGCGCAA 

CTGCGACAGATATCGCGCTTGCGACTGGACTTGCGAATAATACGCTGACGACTATGATAAAAAAGCTAGAGGAAC 

AAAAGCTTGTAATTGTTAGTCCGTGTGGAAAAGACAAGCGTAAGAAGTATTTAGTTTTAACGGAG^ 

CCCAGAAAGAAGTGGGGCATCGTGTCAGTCAGAAATTGGATACTATCTTTTACAAAGGATTTTCAGAGGAAGi^^ 

TTCACCAATTTGAAGGTTTTCAAGAAAGAATTTTGGCGAATCTGAAAGAGAAGGGA^ 

4170.4 

ATGACTAATTTAATTGCAACTTTTCAGGATCGTTTTAGTGATTGGTTGACA 
GCn"rri'GACCTTGTTACTAGCrATTTTGCTTGCGATTCCCTTGGCTGl 

ACTGGGTCTTGCAGATTGCAGGTATTTTCCAGACCATCCCGTCTCTGGCCTTGTTGGGGCTC^^ 
GGAATTGGGACCTTGCCGGCTTTGACAGCTCrAGTGATTTATGCGATTTTCCCTAT^^ 

GGCTGAAGGGAATTGATCCGAACCTGCAAGAGGCTGGGATTGCCITTGGGATGACCAGATGGGAACGTCTCA^ 

AATTTGAAATTCCACrCGCCATGCCTGTTATCATGTCTGGGATTCGGACGGCAGCTGTTTTGA^ 

AACCTTGGCGGCCnTGATTGGTGCAGGGGGACTAGGTTCCTTTATTCIT^ 

TTGATTTTGATTGGGGCACTTTCTTCTGCAGTGCTAGCCATTGCCTTTAAC^ 

CAAAATTACGGACGATTTTCTCAGGTTTTGCCITGGTGGCTTTATTACTGGG 

GTTCAAAAAGAGAAGGAAAACTTGGTTATTGCTGGGAAAATAGGTCCAGAACCAGAAATTTTGG 

AAGTTGCTGATTGAAGAAAATACCAGCATGACTGCGACTGTTAAACCGAATTTTGGGAAGACAAGCTT 

AAGCTCTGAAAAAAGGCGATATTGACATCTATCCTGAATTTACTGGTACGGTGACTGAAAGTTTGC^ 

ACCCAAGGTGAGTCATGAACCAGAACAGGTTTATCAGGTGGCGCGTGATGGCATTGCTAAGCAGGATCATCTAGC 

CTATCTCAAACCCATGTCTTATCAAAACACCTATGCTGTAGCTGTTCCGAAAAAGATTGCTCAAGAATATGGC^ 

AAGACCATTTCAGACITGAAAAAAGTGGAAGGGCAGTTGAAGGCAGGTTTTACACrCGAGTTTAACGACCGTGAA 

GATGGAAATAAGGGCTTGCAATCAATGTATGGTCTCAATCTCAATGTAGCGACCATTGAGCCAGCCCTTCGCTATC 

AGGCTATTCAGTCAGGGGATATTCAAATCACGGATGCCTATTCGACTGATGCGGAATTGGAGCGTTATGATTTACA 

GGTCnTGGAAGATGACAAGCAACTCTTCCCACCTTATCAAGGGGCTCCACTCATGAAAGAAGCTCTTCTCA^ 

ACACCCAGAGTTGGAAAGAGTTCTTAATACATTGGCTGGTAAGATTACAGAAAGCCAGATGAGCCAGCTCAACTA 

CCAAGTCGGTGTTGAAGGCAAGTCAGCAAAGCAAGTAGCCAAGGAGTTTCTCCAAGAACAAGGTTTGTTGAAGA^ 

ATGA 
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4170.5 

ATGATGCATACTTATTTGCAAAAGAAAATTGAAAATATCAAAACAACCCTAGGTGAAATGTCAGGTGGTT 
CGTATGGTTGCGGCTATGGCTGATTTAGGATTrrCAGGAACTATGAAGGCTATCTGGGATGACCTCTTTGCCC^ 

GTAGTTTTGCCCAGTGGATTTATTTGCTGGTTTTAGGAAGTTTTCCrCTCT 
5 ATTGTTGACTGGATTGGGATGATTTGTAGCTTGACAGGGATTATCTGTGTAATCTTTGTA TCGGAA GGTCGAGCAA 

GTAATTATCTTTTTGGCTTGATTAACTCTGTTATTTACCTTATT^ 

CTGACGACACTTTACTTCACAGTCATGCAGCCAATTGGACTTCTAGTTTGGATTTATCAGGCACAGT^ 
AAAAGCAGGAGTTTGTCGCGCGTAAACTGGACGGCAAGGGCTGGACAAAGTAT(mTCCATTAGTGTGC^ 
GGTTGGCCTTTGGCTTCATTTATCAGTCTATTGGTGCCAATCGTCCCTATCGTGATTCAATCACAGATC 
10 TGGGGTAGGGCAAATCCTCATGACAGCTGTTTACCGTGAACAGTGGATATTCTGGGCGGCTACCAATGTUilTlCA 
ATCrATCTCTGGTGGGGAGAAAGCCTGCAAATTCAAGGGAAATATCTAATTTATCTCATTAACAGTCT 
GGTATCAATGGAGCAAGGCAGCrAAGCAGAATACTGATTTACTTAACTAG 

4170.6 

1 5 ATGAGAAATATG AAGGCAAAATATGCTGTTTGGGTGGCITTTTTCTT 

TGCAGGTGGAGTATTTGGTTCTAGCGCTGTTCTTGCTGACTCTGTGCATGACTTGGGAGATGCGA^ 

ATATCAGCITTTCrAGAAACAATCTCCAATCGTGAAGAAGACAATCAGTACACCTTGGGC^ 
TGCTAGGAGCCITGGTAACAGCTGTGATTCrrCGTAACGGGCTCTGTTCTAGTCATTTTGGAA^ 
TTTGCATCCGCAACCAGTCAATGATGAGGGGATTCTCTGGTTAGGAATTATTGCGATTACrATCAATCTGTTA 
20 AGTCTGGTGGTTGGTAAGGGAAAGACAAAGAATGAGTCTATTCTGAGTCTGCATTTTCTGGA AGATA CGCTAGGG 
TGGGTAGCTGTTATCCTGATGGCGATTGTTCTTCGATTTACGGACTGGTATATCCTAGATCCTCIT^ 
CATTTCTTTCTTTATTCTTTCAAAAGCCCITCCACGTTT^ 

AAGGTCTTGATATCAAGCAAGTAAAGAGTGGCCTGGAGCGATTGGACAATGTGGCCAGCCTTAATCAGCTTAATC 
TCTGGACTATGGATGCTTTGGAAAAAAATGCCATTGTCCATGTTTGTCTAAAAGAAATGGAACATATGGAAA^ 
25 • TAAAGAGTCTATTCGAATTTTCCTAAAAGATTGTGGTTTTCAAAATATTACCATTG/^ 

ACTCACCAAACCCATAAGCGAAAGGTGTGTGACTTGGAACGGAGTTATGAGCATCAACATTAG 

4170.8 

ATGATTGAATACAAAAATGTAGCACTGCGCTACACAGAAAAGGATGTCTTGAGAGATGTCAACTTACAGATTGAG 
30 GATGGGGAATTTATGGTTTTAGTAGGGCCITCTGGGTCAGGTAAGACGACCATGCTCAAGATGATTAACCGTC^^ 
TGGAACCAACTGATGGAAATATTTATATGGATGGGAAGCGCATCAAAGACTATGATGAGCGTGAACTTCGTCTTT 
CTACTGGTTATGTTTTACAGGCTATTGCTCnTTTTCCAAATCTAACAGT^ 

ATGAAGGGGTGGAGCAAGGAAGAAATTACGAAGAAAACAGAAGAGCTTTTGGCTAAGGTTGGTTTACCAGTAG 
GAGTATGGGCATCGCTTACCTAGTGAATTATCTGGTGGAGAACAGCAACGGGTCGGTATTGTCCGAGCTATGATTG 
35 GTCAGCCCAAGATTTTCCTCATGGATGAACCCTTTTCGGCCTTGGATGCTATTTCGAGAj^^ 

gacaaaagaattgcataaagagtttgggatgacaacgatttttgtaacccatgatacggatgaagcct^ 
ggcggaccgtattgctgtcttgcaggatggagaaattcgccaggtagcgaatcccgagacaattttaaaagcgcc 

TGCAACAGACTTTGTAGCAGACTTGTTTGGAGGTAGTGTTCATGACTAA 
40 4171.1 

ATGTCAGCAGTTGCTATTTCAGCTATGACCAAGGTTATGCAAGAAACCCACGGAAATCCXrCTAGTATTCATGGTC 

ATGGTCGTCAAGCTGGCAAACrcriTGCGAGAAGCCCGTCAGGAACTAGCCCAGTTACTAAGGACAAAACCTC^ 

ATATCTTTTTCACTTCTGGTGGGACTGAAGGCAATAATACTACCATCATTGGCTACTGTC 

AGGAAAACATATCATCACAACTGCCATCGAGCACCATGCTGTCCTTGAAACAATTGATTACTTGGTTCAACACT^ 
45 GGGTTTGAAGCAACCATTATCCAGCCAGAAAATCAAGAAATCACAGCCCAGCAAATTCAAAAGGCTTTACGTGAC 
GATACGATTTTGGTTTCTACCATGTTTGTCAATAATGAGACAGGAAACCTACTGCCCATCGCTGAAATTGGCCAAA 
TACTCAAGCAACACCCTGCTGCCTATCATGTTGATGCAGTTCAGGCTATTGGTAAAATCCCAATTCATTCAGAAGA 
ATTGGGCATTGATTTTCTCACTGCTTCTGCCCACAAATTCCATGGTCCTAAGGGAATCGGTTT^ 
GCATGGACTTTGATTCCTATCTACATGGCGGAGACCAGGAACAGAAAAAACGTGCAGGAACTGAAAATCTGCCTG 
50 CCATTGTAGGCATGGTTGCAGCCCTAAAAGAAGACCTAGAAAAACAAGAAGAACATTTTCAACATGTACAAAA 
TAGAAACTGCCTTTCTGGCAGAGCrGGAGGGCATTCAGTATTACCTGAATAGAGGAAAACATCATCTCCCT^ 
TCTCAATATTGGATTTCCTGGTCAGAAAAATGACCTCTTACTCCTTCGGCTAGATTTAGCTGGAAT^ 
CTGGCTCAGCCTGTACTGCAGGCGTTGTCCAATCCAGCCATGTTCTTGAAGCCATGTATGGCGCAAATTCAGAACG 
CTTGAAGGAATCCCTTCGCATCAGTTTGTCGCCACAAAATACCGTTGAAGACCTACAAACCCTCGCAAAAAC(^ 
55 AAAGAAATTATCGGAGGTTAG 



60 4172.1 

ATGTTATTCAAATTATCTAAGGAAAAAATAGAGCTAGGCTTATCTCGTTTATCGCCAGCCCGTCGTATTT^^ 

GTTTTGCCTTGGTCATTTTACTAGGCTCTCTTCTTTTGAGCnTGCCCT^ 

TATTTTGATCATCTTTTCACTGCTGTCTCTGCAGTCTGTGTGACGGGTCTCTCAACCCnTCCAGTAGCT 
TAATATCTGGGGTCAAATAATCTGTTTGCTCTTGATTCAGATCGGTGGTCTAGGGCTCATGACCTT^ 
65 TCTATATCCAGAGCAAGCAAAAGCTTAGTCTTCGTAGCCGTGCAACTATTCAGGATAGTTTTAGT^ 
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TCGATCirrGAGAAAGTTTGTCTATTCTATTTTTCTCACGACC^^ 
GTTTTCGCCITATTCCTCAACrTGGCTGGGGACGTGGTCTTT^ 
AATGCCGGTTTTGATAATTTAGGGAGCACCAGTTTATTTGCTTTTCAGACCGA^ 
AGGCTTGATTATTACAGGCGGCCTTGGTTTTATGGTCTGGTTTGATTTGGCTGGTCATGTAGG^^ 
5 GGACGTCTGCACITTCATACGAAGCTTGTACTATTATTGACTATAGGTTTGTTGTT^ 

CTTTCTTGAGTGGAACAATGCTGGAACGATTGGCAATCTCCCrGTTGCCGATAAGGTTT^ 
ACAGTGACGATGCGAACAGCTGGCnTTTCTACGATAGATTATACTCAGGCTCATCCTGTGACTCT^ 
TCITACAGATGTTTCTAGGTGGGGCACCTGGAGGAACAGCTGGGGGACTCAAGATTACGACATTTTTTGTCCT 
GGTCTTTGCACGAAGTGAGCTTCTAGGCTTGCCTCATGCCAATGTTGCGAGACGAACGATCGCGCCGCGAACGGTT 

10 CAAAAATCCITTAGTGTCTTTATTATCrTTTTGATGAGC^ 

AGGCAATCCrCCCITTATCCACCTCGTATTTGAAACCATTTCAGCTCTTAGTACAGT^ 
ACTCCTGACCTTGGGAAATTGGCTCTCAGTGTTATCATGCCACTTATGTTTATGGG 

ACGAATTGGTCCCTTGACCTTGTTTGTTAGCTTGGCAGATTACCATCCAGAAAAGAAAGATATGATTCACTA 
AAAGCAGATATTAGTATTGGTTAA 

15 

4172.2 

ATGTCAGATCGTACGATTGGAATTTTGGGCITGGGAATTTTTGGGAGCAGTGTCCTAGCTO 

ATATGAATATTATCGCTATTGATGACCACGCAGAGCGCATCAATCAGTTTGAGCCAGTTTTGGCGCGTGGAGTGAT 
TGGTGACATCACAGATGAAGAATTATTGAGATCAGCAGGGATTGATACCTGCGATACCGTTGTAGTCGCGACAGG 
20 TGAAAATCTGGAGTCGAGTGTGCITGCGGTTATGCACTGTAAGAGTTTGGGGGTACCGACTGTTATTGCTAAGGTC 
AAAAGTCAGACCGCTAAGAAAGTGCTAGAAAAGATTGGAGCTGACTCGGTTATCTCGCCAGAGTATGAAATGGGG 
CAGTCTCTAGCACAGACCATTCnTTTCCATAATAGTGTTGATGTCTTTCAGTTGGATAAAA^ 

AGATGAAAATTCCTCAGTCITGGGCAGGTCAAAGTCTGAGTAAATTAGACCTCCGTGGCAAATACAATCTGAATA 
TTTTGGGTTTCCGAGAGCAGGAAAATTCCCCATTGGATGTTGAATTTGGACCAGATGACCTC^ 
25 CrATATTTTGGCAGTCATCAACAACCAGTATTTGGATACCCTAGTAGCATTGAATTCGTAA 

4172.3 

ATGAAGTTATTGTCTATCGCAATTTCTAGCTATAATGCAGCAGCCTATCTTCATTACTGTGTGGAGTCGCT 
TTGGTGGTGAGCAAGTTGGGATTTTGATTATCAATGACGGGTCTCAGGATCAGACTCAGGAAATCGCTGAGTGTTT 
30 AGCTAGCAAGTATCCTAATATCGTTAGAGCCATCTATCAGGAAAATAAATGCCATGGCGGTGCGGTCAATCGTGG 
CTTGGTAGAGGCn'CTGGGCGCTATTTTAAAGTAGTTGACAGTGATGACTGGGTGGATCCTCG^^ 
ATTCTTGAAACCTTGCAGGAACITGAGAGCAAAGGTCAAGAGGTGGATGTCTTTGTGACCAATT^ 
AGGAAGGGCAGTCrCGTAAGAAGAGTATGAGTTACGATTCAGTCTTGCCTGTTCGGCAGATTTTTGGCTGGGACCA 
GGTCGGAAATTTCTCCAAAGGCCAGTATACCATGATGCACTCGCTGATTTATCGGACAGATTTGTTGCGTGCTAGC 

35 CAGTTCTAA 
4172.4 

ATGAAATTCAATCCAAATCAAAGATATACTCGTTGGTCTATTCGCCGTCTCAGTGTCGGTGTTGCCTCAGTTGTTG 
TGGCTAGTGGCTTCTTTGTCCTAGTTGGTCAGCCAAGTTCTGTACGTGCCGATGGGCTCAATCCAACCCCAGGTCA 
40 AGTCTTACCTGAAGAGACATCGGGAACGAAAGAGGGTGACTTATCAGAAAAACCAGGAGACACCGTTCTCACTCA 
AGCGAAACCTGAGGGCGTTACTGGAAATACGAATTCACITCCGACACCTACAGAAAGAACTGAAGTGAGCGAGGA 
AACAAGCCCITCTAGTCrGGATACACrTTTTGAAAAAGATGAAGAAGCTCAAA^ 

CTTAAAAGAAACTGTAGATACAGCTGATGTGGATGGGACACAAGCAAGTCCAGCAGAAACTACTCCTGAACAAGT 
AAAAGGTGGAGTGAAAGAAAATACAAAAGACAGCATCGATGTTCCTGCTGCTTATCITGAAAAAGCTGAAGGGA^ 

45 AGGTCCTTTCACTGCCGGTGTAAACCAAGTAATTCCTTATGAACTATTCGCTGGTGATG GTA TGTTAACTCGTCTA 
TTACTAAAAGCTTCGGATAATGCrCCTTGGTCTGACAATGGTACTGCTAAAAATCCTGC^^ 
GATTAACAAAAGGGAAATACTTCTATGAAGTAGACnTAAATGGCAATACTGTTGGTAAACA^ 
TTGATCAACTTCGCGCTAATGGTACTCAAACTTATAAAGCTACTGTTAAAGTTTACGGA^ 
TGACTTGACTAATCTAGTTGCTACTAAAAATGTAGACATCAACATCAATGGATTAGTTGCTAAAGAAAC^ 

50 AAAGCCGTTGCAGACAACGTTAAAGACAGTATCGATGTTCCAGCAGCCTACCTAGAAAAAGCCAAGGGTGAAGGT 
CCATTCACAGCAGGTGTCAACCATGTGATTCCATACGAACTCTTCGCAGGTGATGGCATGTTGACrCGTCTCTTGC 
TCAAGGCATCTGACAAGGCACCATGGTCAGATAACGGCGACGCTAAAAACCCAGCCCTATCTCCACTAGGCGAAA 
ACGTGAAGACCAAAGGTCAATACTTCTATCAAGTAGCCTTGGACGGAAATGTAGCTGGCAAAGAAAAACAAGCGC 
TCATTGACCAGTTCCGAGCAAATGGTACTCAAACTTACAGCGCTACAGTCAATGTCTATGGTAACAAAGACGGTA 

55 AACCAGACTTGGACAACATCGTAGCAACTAAAAAAGTCACTATTAACATAAACGGTTTAATTTCTAAAGA/^ 

TTCAAAAAGCCGTTGCAGACAACGTTAAAGACAGTATCGATGTTCCAGCAGCCTACCTAGAAAAAGCCAAGGGTG 
AAGGTCCATTCACAGCAGGTGTCAACCATGTGATTCCATACGAACTCTTCGCAGGTGATGGTATGTTGACTCGTCT 
CTTGCTCAAGGCATCTGACAAGGCACCATGGTCAGATAACGGTGACGCTAAAAACCCAGCCCTATCTCCACTAGG 
TGAAAACGTGAAGACCAAAGGTCAATACTTCTATCAATTAGCCTTGGACGGAAATGTAGCTGGCAAAGAAAAACA 

60 AGCGCTCATTGACCAGTTCCGAGCAAACGGTACTCAAACTTACAGCGCTACAGTCAATGTCTATGGTAACAAAGA 
CGGTAAACCAGACTTGGACAACATCGTAGCAACTAAAAAAGTCACTATTAACATAAACGGTTTAATTTCTAAAGA 
AACAGTTCAAAAAGCCGTTGCAGACAACGTTAAGACAGTATCGATGTTCCAGCAGCCTACCTAG 
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ATGAAACTAAAAAGTTATATTTTGGTTGGATATATTATTTCAACCCrCTTAACCAT^^ 
TCAAAAAATGCTGATTGCGAAAGGCGAGATTTACnTTTTGCITGGGATGACCATCGTTGCCAGCC^ 
GGGATTAGTCTCTTTCTCCTATTGCCAGTCTTTACGTCGTTGGGCAAACTCAAGGAGCATGCCAAGCGGGTAGCGG 
CCAAGGATTTTCCTTCAAATTTGGAGGTTCAAGGTCCTGTAGAATTTCAGCAATTAGGGCAA^ 
5 GTCCCATGATTTGCAGGTAAGCTTTGATTCCTTGGAAGAAAGCGAACGAGAAAAGGGCITGATGA^ 

GTCGCATGATATTAAGACTCCTATCACTTCGATCCAAGCGACGGTAGAAGGGATTTTGGATGGGATTATCAAGGA 
GTCGGAGCAAGCrCATTATCTAGCAACCATTGGACGCCAGACGGAGAGGCTCAATAAACTGGTTGAGGAGTTGAA 
TTTTTTGACCCTAAACACAGCTAGAAATCAGGTGGAAACTACCAGTAAAGACAGTATTTT^ 
ATTGAGTGCATGAGTGAATTTCAGTTTTTGATTGAGCAGGAGAGAAGAGATGTCCACTTGCAGGTAATCCC 
10 CTGCCCGGATTGAGGGAGATTATGCTAAGCTTTCTCGTATCTTGGTGAATCTGGTCGATAACGCmTTy^ 

TGCTCCAGGAACCAAGCTGGAAGTGGTGGCTAAGCTGGAGAAGGACCAGCTTTCAATCAGTGTGACCGATGAAGG 
GCAGGGTATTGCCCCAGAGGATTTGGAAAATATTTTCAAACGCCTTTATCGTGTCGAAACTTCGCGTAACA 
ACAGGTGGTCATGGATTAGGACITGCGATTGCGCGTGAATTGGCCCATCAATTGGGTGGGGAAATCACAGTCAGC 
AGCCAGTACGGTCTAGGAAGTACCTTTACCCrCGTTCTCAACCTCTCTGGTAGTGAAAATAAAGCCTAA 

15 

4172,6 

ATGTTTGGTCAAACGGCTCAACATGGTCTTACGAATAGCCTGAAAGACTTCrGGATTTTTCT 
CACAATTGGCGTTTTTTTGCCAGATGCTCCGCTGTTCCAGATCGGTTGAGCAGGGTACTGGAA^ 
GTTCAATATGATTCAGCAGATATTCTCGCATTTTGGGATGACTCACTTGGGACAAATCAAGTTGGTCT 
20 TCGATTGACCrTGAGTTGCTGGTCAATGCACrTAATCATCACTTGCrCATTGACAGA 

AAATAACGATAGAAATCGACAGGCAGATAGTACATGGTCTTGACCTGCTGAAGGGGCGTAAAGACAAAGAGATTA 

TCGACATAAAAAGTATGTTCAGGCAGTTAGAACTGGCTAGCACGCAACAAATCTGTCCGATAAATCAGCGAGTGC 

ATCATGGTATACTGGCCTTTGGAGAAATTTCCGACCrGGTCCCAGCCAAAAATCTGCCGAACAGGCA^ 

25 4174.1 

ATGGAACATTTAGCAACTTATTTTTCAACCTATGGAGGAGCTTTCTTCG 

GATTAAGCGGTATGGGGTCTGCTTATGGAGTTGGTAAGGCTGGGCAATCTGCCGCAGCTTTACTGAAAGAACAGC 
CTGAAAAGTTTGCCTCAGCTTTGATATTGCAATTATTGCCCGGAACACAAGGATTATATGGTT^ 

TTTAATTTGGTTGCAATTAACTCCAGAACITCCTTTAGAAAAAGGCGTTGC^ 
30 CT ATTGTAGGATACnTTTCAGCTAAGCATCAAGGAAATGTAGCAGTAGCGGGAATGCAAATCTTGGCT 

CAAAAGAATTCATGAAGGGAGCAATTTTAGCTGCCATGGTAGAAACCTATGCAATTCTTGCTT^ 
CATTTTGACCCTTCGTGTATAA 
4175.2 

ATGTTAAAATCAGAAAAACAATCACGTTATCAAATGTTAAATGAAGAATTGTCCTTCCTATTGGAAGGCGAAACC 

35 AATGTTTTGGCTAATCTTTCCAACGCCAGTGCTCTCATAAAATCACGTTTTCCTAATACCG 

TTTGTTCGATGGAAAGGAATTGGTTTTAGGCCCCTTCCAAGGAGGTGTTTCCTGCATCCGTATTGCACTAGG 
GGTGTTTGTGGTGAGGCAGCTCACTTTCAGGAAACTGTTATTGTTGGAGATGTGACGACCTATCTCAACTATAT^ 
CTTGTGATAGTCTAGCTAAuAAGTGAAATTGTGGTGCCGATGATGAAGAATGGTCAGTTACTTGGAGTTCTGGATCT 
GGATTCTTCAGAGATTGAGGATTACGATGCTATGGATCGAGATTATTTGGAACAATTTGTCGCTATT^ 

40 AAGACAGCATGGGACTTTACGATGTTTGAGGAAAAATCTTAA 

4175.3 

ATGTCAGTATTAGAGATCAAAGATCITCACGTTGAGATTGAAGGAAAAGAAATTTTAAAAGGGGTTAACCT 
CTGAAAACAGGAGAAATTGCCGCTATCATGGGACCAAATGGTACAGGTAAATCGACTCTTTCTGCCGCTATCATG 

45 GGAAATCCAAACTATGAAGTAACTAAAGGTGAAGTTTTGTITGATGGCGTAAACATCCT^ 

GAGCGTGCGCGTATGGGACTTTTCCTTGCTATGCAATACCCATCAGAAATCCCTGGAATTACCAATGCT 
TTCGTGCCGCTATGAATGCGGGTAAAGAAGATGATGAGAAGATTTCAGTTCGTGAGTTTATTACTAAGCT 
AAAAATGGAATTGCTCAACATGAAAGAAGAAATGGCAGAGCGTTACCTCAACGAAGGCTTCTCTGGTGG 
AAAACGCAATGAAATTCITCAACTTTTGATGTTGGAGCCAACATTTGCTCT^^ 

50 GATATTGACGCTCTTAAAGTTGTGTCTAAAGGTGTCAATGCCATGCGTGGTGAAGGTTTTGGTGCT 

CTCACTACCAACGTCITTTGAACTATATCACACCrGATGTGGTACACGTGATGATGGAAGGTCGTGTTGTCC^ 
TGGTGGTCCAGAATTGGCTGCGCGTTTGGAACGTGAAGGATACGCAAAATTAGCTGAAGAACTTGGCTACGACT 
CAAGGAAGAATTGTAA 

55 4174.4 

ATGCCCTACAAAAGACAAAGGAGTTTTTCAATGGCACTTTCTAAACTAGATAGCCTTTATATGGCAG 
ACCATTCGAAAAATCCACATCACCAAGGGAAGTTAGAAGATGCTGAGCAAATCAGTCTCAACAATCCGACTTGTG 
GGGATGTCATCAACCTCTCTGTCAAGTTTGATGCAGAGGACCGTTTGGAAGATATTGCTTTTCTAAATTCAGGATG 
CACGATTTCAACTGCTTCTGCTAGTATGATGACAGATGCCGTTTTAGGAAAAACCAAACAAGAAATT^ 
60 GCGACTATTTTTTCTGAAATGGTTCAAGGGCAAAAAGATGAGCGTCAAGACCAACTTGGAGACGCGGCATTCTTG 
TCAGGTGTTGCCAAATTCCCTCAAAGAATCAAGTGTGCAACCCTAGCTTGGAATGCCCTTAAGAAAACAATTGAA 
AATCAAGAAAAACAGTAA 
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ATGAAAATTCAAGACCTATTGAGAAAAGATGTCATGTTGCTAGATTTGCAGGCAACTGAAAAAACAGCTGTCATC 
GACGAGATGATTAAAAATTTGACAGACCACGGTTATGTAACAGATTTTGAAACATTTAAAGAAGGAAT^^ 
CGTGAAGCTTTGACTTCTACTGGTTTGGGTGATGGAATCGCAATGCCTCACAGCAAAAACGCTGCT 
GCGACAGTTCTATTTGCTAAGTCAAATAAGGGTGTTGACrACGAGAGCTTGGATGGACAAGCAACrGACCT 
5 TCATGATTGCAGCTCCAGAAGGTGCCAATGATACrCACITGGCAGCCTTGGCAGAATTGTCTCAATACT^ 
AGACGGTTTTGCAGACAAACTTCGTCAAGCAACATCTGCAGACCAAGTTATCGAACTTT^ 
AAAACTGAGGAACITGTTCAAGCACCTGCTAATGACTCTGGTGACTTTATCGTAGCTGTTA^ 
GTATTGCCCACACnTACATGGCCCAAGAAGCCCTTCAAAAAGTAGCrGCTGAAATGGGGGTTGGT^^ 
AAACCAACGGTGCTAGCGGTGTTGGAAATCAACTAACTGCAGAAGATATCCGTAAGGCTAAAGCTATTATCAT^^ 

10 CAGCAGACAAGGCCGTTGAAATGGATCGATTTGATGGAAAACCATTGATCAATCGTCCAGTTGCTGACGGTATCC 
GTAAGACAGAAGAGCTAATTAACTTGGCTCTTTCAGGAGATACTGAAGTCTACCGTGCCGCTAATC 
CTGCAACAGCCTCTAACGAAAAACAAAGCCTTGGTGGTGCCTTGTACAAACACTTGATGAG^ 
TGTTACCATTCGTTATCGGTGGTGGTATCATGATTGCCCTTGCCTTCTTGATTGACGGTGCTTTGGGTGTT^^ 
GAAAACCTTGGCAATCTTGGTTCrrACCATGAGTTAGCTTCTATGTTCATGAAAAT^ 

15 GATGCTTCCAGTCTTTGCGGGTTATGTTGCCTACTCTATTGCTGAAAAACCGGGTTTGGTAGCAGGT^ 
GGTGCTATTGCCAAAGAAGGTTTTGCCnTTGGTAAAATTCCITATGCCGCAGGTG 
CAGGTGTCTCATCTGGTTTCCTAGGTGCCCTTGTTGGTGGATTTATCGCAGGTGCCnTGGTTCT^ 

TACGTTAAAGTTCCTCGTTCACTCGAAGGTGCTAAATCAATCCTTCTATTGCCACn^ 
GATTTGTTATGCTAGCTGTGAATATCCCAATGGCTGCAATCAACACTGCTATGAATGACTTCCTAGGTO 
20 AGGAGGTTCAGCTGTCCTTCITGGTATCGTCCTTGGTGGAATGATGGCrGTTGACATGGGTGGACCA 
GCAGCITATGTCTTTGGTACAGGTACGCTTGCAGCAACTGTTTCTTCAGGTGGT^ 
TGGCTGGAGGAATGGTGCCACCACnTGCAATCTTTGTCGCAACTCnTCnTTTCA^ 

ACGTAACTCTGGTTTGACAAACATCATCATGGGCTTGTCATTTATCACTGAGGGAGCGATTCCATTTGGTG^ 
GACCCAGCTCGTGCGATTCCAAGCTTCATCCTTGGTTCAGCAGTAGCAGGTGGACTCGTTGGTCTTACTGGTAT^ 
25 AACTCATGGCGCCACACGGAGGAATCTTCGTTATCGCCCTTACITCAAATGCTCTCC^ 
GTAGGAGCAATCGTAAGTGGTGTGGTTTATGGTTACCTACGCAAACCACAAGCATAA 

4175,6 

ATGGCAAACAAGAATACAAGTACAACAAGACGGAGACCGTCTAAAGCAGAACTGGAAAGAAAAGAAGCGATTCA 
30 ACGAATGTTGATTTCGTTAGGAATTGCGATTTTATTGATTTTCGCAGCCTTCAAATTAG^ 
CTTTATAATTTAATTCGCTTGCTAGTGGGTAGCCTAGCTTATCTGGCGATATTCGGCCT 
TTTCAAGTGGATACGAAAACAGGAAGGACTCrTATCrGGCTTTTTCACCATAT^ 

AGGCCTACnTGGTTTGGAAATATGGTTTGGACAAGTCCGTTCTAAAAGGGACCATGGCrCAGGTTGTGACAGAT 
GACTGGTTTTCGAACGACTAGCTTTGCTGGAGGGGGCTTGATCGGGGTCGCTCTT^ 

35 TTTTCAAATATCGGAACITACTTTATTGGTTCTATCTTGATTT^ 

TACGATATTGCTGAATTTTTCAGTAGAGGCTTTGCCAAATGGTGGGAAGGGCACGAGCGTCGAAAAGAGG 
TTTGTCAAACAAGAAGAAAAAGCTCGCCAAAAGGCTGAGAAAGAGGCTAGATTAGAACAAGAAGAGACTG 
AGCCTTACTCGATTTGCCTCCTGTTGATATGGAAACGGGTGAAATTCTGACAGAGGAAGCTGTTCAAAATC^ 
CCTATTCCAGAAGAAAAGTGGGTGGAACCAGAAATCATCCTGCCTCAAGCrGAACTTAAATTCCCTG 

40 GATGACTCAGATGACGAAGATGTTCAGGTCGATTITTCAGCCAAAGAAGCCCITGAATACAAAC^ 

CAACTCTTTGCACCAGATAAACCAAAAGATCAGTCTAAAGAGAAGAAAATTGTCAGAGAAAATATCAA^ 
GAAGCAACCnTTGCTAGCTTTGGTATTAAGGTAACAGTTGAACGGGCCGAAATTGGGCCATCAGTGACCAAGTAT 
GAAGTCAAGCCGGCTGTTGGTGTAAGGGTCAACCGCATTTCCAATCTATCAGATGACCTCGCTCTAGCCTT 
CCAAAGATGTCCGGATTGAAGCACCAATCCCTGGGAAATCCCTAATCGGAATTGAAGTGCCCAACT CCGA TATTG 

45 CCACTGTATCITTCCGAGAACTATGGGAACAATCGCAAACGAAAGCAGAAAATTTCnTGGAAAT^ 
AGGCTGTTAATGGAACCGCAAGAGCTTTTGACCTTTCTAAAATGCCCCACT^ 

AGGGAAGTCAGTAGCAGTTAACGGCATTATTGCrAGCATTCTCATGAAGGCGAGACCAGATCAAGTTAAATTTA^ 

GATGGTCGATCCCAAGATGGTTGAGTTATCTGTTTACAATGATATTCCCCACCTCITGATTCCAOT 

CCACGCAAAGCCAGCAAGGCTCTGCAAAAGGTTGTGGATGAAATGGAAAACCGTTATGAACTCn^ 

50 GGAGTTCGGAATATTGCAGGTTTTAATGCCAAGGTAGAAGAGTTCAATTCCCAGTCTGAGTACAAGCAA^ 

CTACCATTCATTGTCGTGATTGTGGATGAGTTGGCTGACCTCATGATGGTGGCCAGCAAGGAAGTGGAAGATGCTA 
TCATCCGTCTTGGGCAGAAGGCGCGTGCTGCAGGTATCCACATGATTCTTGCAACTCAGCGTCCATCTGTTGATGT 
CATCTCTGGTTTGATTAAGGCCAATGTTCCATCTCGTGTAGCATTTGCGGTTTCATCAGGAACAGACTCCCGTACG 
ATTTTGGATGAAAATGGAGCAGAAAAACTTCTTGGTCGAGGAGACATGCTCTTTAAACCGAT^ 

55 CCAGTTCGTCTCCAAGGCTCCTTTATCTCGGATGACGATGTTGAGCGCATTGTGAACITCATCAAGACrCAGGCAG 
ATGCAGACTACGATGAGAGTTTTGATCCAGGTGAGGTTTCTGAAAATGAAGGAGAATTTTCGGATGGAGATGCT 
GTGGTGATCCGCTTTTTGAAGAAGCTAAGTCTTTGGTTATCGAAACACAGAAAGCCAGTGCGTCTATG^ 
TCGTTTATCAGTTGGATTTAACCGTGCGACCCGTCTCATGGAAGAACTGGAGATAGCAGGTGTCATCGGTCCAGCT 
GAAGGTACCAAACCTCGAAAAGTGTTACAACAATAA 

60 

4176.1 

ATGAGTTATTTTAAAAAATATAAATTCGATAAATCCCAGTTCAAACTTGGTATGCGAACCTT^ 
CTGTTTTTCTAGTTCTCTTGATTTTTGGCTTTTTTGGCT 

GCCTGAGGGAGAGTTrrGATGAGAGTGTTCATTTTGGGACTTCGCGTATTCTAGGAAATAGTATCGGTGGACTCTA 
65 TGCCnTGGTCTTCTTCTTATTAAATACCTTTTTCCACGAAGCCTTTTGGGTGACC^ 
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CCATGTTAACCATTATGACAAATGTAGCCATGAATAACAAAGCAGGGGTTATTGGTGGTGTAGCAGCTATGTTAAT 

CATTACCCTATCAATTCCAAGTGGTGAGACAATTTTGTACGTGTTTGTGCGTGTATTAGAAACGTTTATGGGAGTT 

TTTGTCGCAATTATCGTAAATTACGATATTGATCGTATTCGTCTCTTTTTAGAGAAAAAAGAAAAAT>^ 

4178.2 

ATGAATAAATCAGAACACCGCCACCAACTTATACGCGCTCITATCACAAAAAACAAGATTCATACACAGGCTGAG 

TTGCAAGCCCTTCTTGCTGAGAACGACATTCAAGTAACCCAGGCAACCCTCTCACGCGACATCAAAAATATGAAC 

CTATCAAAAGTCCGCGAAGAAGATAGCGCTTATTATGTTCTTAACAATGGTTCCATCTCAAAATGGGAAAAACGTC 

TCGAACTCTACATGGAAGACGCCCTTGTCTGGATGCGCCCAGTTCAACACCAAGTCCTACTAAAAACCCTTCCTGG 

ACTGGCTCAATCCTTTGGTTCTATCATTGATACTTTGAGCTTCCCTGACGCTATCGCTACCCTT^ 

TCTGTCrTATCATCTGTGAAGATGCAGATACTGCTCAAAAGTGCTTTGAAGAACTGAAAAAATTCGCCCCACCAT^ 

TTTCTTTGAAGAATAA 

4179.1 

ATGAAAAGTATAAAATTAAATGCTCTATCITACATGGGAATTCGTGTCITGAATAT^ 

AACCTATGTCGCGCGTGTCTTGGACCGAACTGACTATGGTTACTTCAACTCAGTCGACACTATTTTGTCAT^^ 

TGCCCTTTGCAACnTATGGTGTCrATAACTACGGTTTAAGGGCTATCAGTAATGTCAAGGATi^ 

TAACAGAACCnrrrCTAGTCTTTTTTATTTGTGCATCGC^ 

ATCCTCTCrrCITTACTGATAATCCAATCGTCAAAAAGGTCTACCTTGTTATGGGGATTCAACT 

TTTTCAATCGAATGGGTCAATGAAGCTCTGGAAAATTACAGTTTTCTC^^ 

GATGCTGGTCTCTATTTTCTTATTTGTTAAAAATGAACACGATATTGTTGTCTATACA 

CGCTGATTAACTACCTGATTAGTTATTTTTGGATTAAAAGAGACATCAAACTTGT^ 

TAAACCACTCITTCTCCCTCTGACAGCCATGTTAGTCTTTGCCAATGCCAATATGCTC^ 

TCITCCTCGTTAAAACAGGGATTGATGTCAACGTTAGTTACTATACCATAGCTCAGCGAATTGTGACCGT^ 

TGGGGTTGTAACAGGTGCAATTGGAGTGAGTGTGCCTCGTCTCAGTTACTATCTGGGGAAAGGAGACAAAGAAGC 

CTATGTTTCTCTGGTTAATAGAGGTAGTCGAATCTTTAACTTCnTTATCA^ 

TAGGACCAAATGCCATCCTACTTTACGGTAGTGAAAAATATATCGGAGGCGGCATCITGACCTCTCTCT^ 

TCGTACGATTATCCTGGCCTTAGATACCATTCTTGGTTCCCAAATTCrCTTTACCAATGGCTATGAAAAA^ 

ACAGTCTATACAGTCTTTGCTGGGCTACTCAATTTGGGCTTGAATAGTCTCCTTTTT^ 

TGAATACTACTTACTGACAACTATGCTATCAGAGACTTCTCrACTTGTTTTCTATATCATTT^ 

AACTCATCCACTTGGGACATATCTTTAGCTATACTGTTCGATACTCTCTCITTTCACTT^ 

TCCTGATTAATTTCGTGTATCCTGTAGATATGGTCATTAATTTGCCATTTTTGATTA 

ATACTGGTTTGATTGTCTTGCTATCAGCTATCTCTTATATTAGTCTACTTGTCTTCACAAAAGA^^ 

GAATTTTTAAACCATGTCCTAGCCTTAAAAAATAAATTTAAAAAATCATAG 

4179.2 

ATGAAACAACrAACCGTTGAAGATGCCAAACAAATTGAATTAGAAATTTTGGATTATATTGATACT 

AGCACAATATCAACTATATTATTAACTACGGTACTCTGATTGGGGCGGTTCGACATGAGGGCTTTATCCCTTGGGA 

CGACGATATTGATCTGTCCATGCCTAGAGAAGACTACCAACGATTTATTAACATTTTTCAAAAGGAAAAAAG^ 

GTATAAGCTCCTATCCTTAGAAACTGATAAGAACTACTTTAACAACTTTATCAAGATAACCG^ 

ATTATTGATACTCGAAATACAAAAACCTATGAGTCTGGTATCTTTATCGATATTTTCCCTATAGATCGC^^ 

TCCTAAGGTCATTGATACTTGTTATAAACTGGAAAGCTTCAAACTGCTGTCTTTCAGTAAACATAAAAATA^ 

TATAAGGATAGCCTTTTAAAAGATTGGATACGAACAGCCTTCTGGTTACTCCTTCGACCGGTTTCTCCTCGT^ 

TGCAAATAAAATCGAGAAAGAAATTCAAAAATATAGTCGTGAAAATGGGCAATATATGGCTTT^ 

ATTTAAGGAAAAGGAAGTCTTCCCAAGTGGTACCTTTGATAAAACAATCGATTTACCCT^ 

CCTGCACCTGAAAAATTTGATACTATTTTGACACAATTTTATGGAGATTATATGACCCT 

GCrTCTACAGTCATGAATTTCACGCTTATAAATTGGAGGATTAG 

4179.3 

ATGATAAAAATCAATCATCTAACCATCACACAAAACAAAGATTTACGAGATCTTGTATCTGACCTAACCATGACC 
ATCCAAGACGGGGAAAAGGTTGCTATTATTGGTGAAGAAGGAAATGGCAAATCAACCTTACTTAAAATTT^ 
GGGGAAGCITTGTCTGATTTCACTATCAAGGGAAACATCCAATCTGACTATCAGTCACrGGCCTACATTCCTCy^^ 
AAGTCCCTGAGGACCTAAAAAAGAAAACTTTACACGACTACTTCTTTTTAGATTCT 

CCTCTATCGTTTGGCGGAGGAATTGCATTTTGATAGCAATCGTTTCGCAAGTGACCAAGAGATTGGCAATCTATCA 
GGGGGCGAAGCTTTGAAAATTCAGCITATCCATGAGTTAGCCAAACCCTTTGAGATTCT 

CAAATGACCTAGACCTTGAGACAGTTGATTGGCTAAAAGGCCAGATTCAAAAGACCAGGCAAACCGTTATTTTCA 
TTTCCCATGATGAAGACTTTCITTCrGAAACGGCAGACACrATTGTTCAC^ 

AGCGGAAACGCTAGTAGAGCATTTAGACTATGATAGCTATAGTGAGCAGAGAAAGGCTAATTTTGCCAAACAAAG 

TCAGCAAGCTGCTAACAACCAAAGAGCCTACGATAAAACCATGGAAAAACATCGGAGAGTTAAGCAAAATGTAG 

AAACTGCGCTTCGAGCTACCAAAGATAGTACTGCCGGTCGCCTATTGGCTAAAAAGATGAAAACTGTCCTCTCAC 

AAGAAAAACGCTACGAAAAGGCAGCTCAGTCCATGACTCAAAAGCCACITGAAGAGGAACAAATCCAACTTT^ 

TTTCAGACATCCAACCATTACCAGCTTCTAAAGTCTTAGTCCAACTGGAAAAAGAAAATTTGTCCATTGACGACCG 

AGTTTTGGTTCAAAAACTACAACTAACTGTCCGTGGCCAAGAAAAAATCGGTATTATCGG 

GCCAAATGGTGTTGGGAAATCAACTCTGTTAGCCAAGTTACAGAGACTTCTGAATGATAAAAGAGAGATTTCACT 
TGGTTTTATGCCACAAGATTACCACAAAAAACTGCAATTGGATTTATCCCCAATAGCCTATCTCAGTAAAACTGGG 
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GAAAAAGAGGAACTACAGAAAATCCAATCTCACCrAGCTAGTCrCAATTTCAGTTATCCAGAAATGCAGCATCAA 

ATTCGCTCCTTATCTGGCGGACAACAGGGAAAACTCCTGCITTTGGATTTAGTCCTGCGCy^ 
TGCTGGATGAACCCACACGAAACTTTTCTCCCACTTCTCAACCCCAAATCAGAAAACTCTTTGCTACCTATCCA^ 
CGGTCrCATCACTGTTTCGCATGACCGTCGTTTCTTAAAAGAAGTCTGCTCGATCATCTATCGCATGACAGAACAC 
5 GGTTTGAAGCTAGTTAATTTAGAAGATTTATAA 

4179.4 

ATGAAACCAAAAACATTTTACAACITGCrrGCCGAGCAGAATCTTCCACTTTCGGACCAGCAA^ 
GAACGTTATTTTGAGCTCTTGGTCGAGTGGAATGAGAAGATTAATTTGACGGCGATTACGGACAAGGAAGAAGT^ 

10 TATCTCAAACATTTTTACGATTCGATTGCACCCATTCTTCAAGGTTTGATTCCCAATGAAACTATC^^ 

TATCGGGGCTGGGGCAGGATTTCCTAGTCTACCAATGAAAATTCTCTATCCGGAGTTAGATGTGACCATTATTGAT 
TCACTCAATAAGCGCATCAACTTCCTACAACTCITGGCTCAAGAACTGGATTTGAACGGAGTTCA 
GACGTGCCGAAGATTTTGCCCAAGACAAGAACTTCCGTGCTCAATATGATTTTGTAACAGCTCGTGCGGTTGC^^ 
TATGCAGGTCCTATCTGAATTGACTATTCCCTACCTTAAGGTTGGTGGCAAACTATTAGCACTCAAGGCTAGCAAT 

15 GCGCCTGAGGAATTATTAGAAGCTAAGAATGCCCTCAATCTCCTTTTTAGTAAGGTCGAAGACAATCT 

CCCTACCGAATAGAGATCCGCGCTATATCACAGTGGTAGAAAAGAAAAAAGAAACACCAAATAAATATCCACGTA 
AGGCTGGTATGCCAAATAAACGCCCACTTTAA 

4179.6 

20 ATGAGTATTAAACTAATTGCCGTTGATATCGACGGAACCCTTGTCAACAGCCAAAAGGAAATCACTCCTGAAGT^ 
TTTCTGCCATCCAAGATGCCAAAGAAGCTGGTGTCAAAGTCGTGATTGCAACTGGCCGCCCTATCGCAGGCGTTGC 
CAAACTTCTAGACGACTTGCAGTTGAGAGACGAGGGGGACTATGTGGTAACCTTCAACGGTGCCCITGTCC 
AACTGCTACAGGACATGAGATTATCAGCGAATCCTTGACITATGAGGATTATCTAGATATGGAATTCCTCAGTCGC 
AAGCTCGGTGTCCACATGCATGCCATTACCAAGGACGGTATCTATACTGCAAATCGCAATATCGGAAAATACACT 

25 GTACACGAATCAACCCTCGTCAGCATGCCTATCTTCTACCGTACCCCTGAAGAAATGGCTGG CAAA GAAATTGTTA 
AATGTATGTTTATCGATGAACCAGAAATTCTCGATGCTGCGATTGAAAAAATTCCAGCAGAATTT^ 
CTCCATCAACAAATCTGCTCCTTTCTACCTCGAACrCCTTAAAAAGAATGTAGACAAGGGTTCAGCCAT^ 
TTGGCTGAAAAACTCGGATTGACCAAAGATGAAACCATGGCAATCGGTGATGAAGAAAATGACCGTGCCATGCTG 
GAAGTCGTTGGAAACCCCGTTGTCATGGAAAATGGAAATCCAGAAATCAAAAAAATCGCCAAATACATCACCAAA 

30 ACAAATGACGAATCCGGCGTTGCCCATGCCATCCGAACATGGGTACTGTAA 

4179.7 

ATGACTTGGATTATTCTTGGAGTTATCGCTCTTATTGTTATTTTTGTGATTGTTAGCTATA^ 

TCGTATGCAAACCAAGGAGGCTTGGAGTCAGATTGATGTTCAGTTGAAACGTCGCAATGACCTCTTGCCAAACTTG 
35 ATTGAGACTGTAAAAGGTTATGCCAAATATGAAGGTTCTACCCTTGAAAAGGTGGCAGAACTACGTAACCAAGTG 
GCGGCAGCGACTTCACCAGCAGAAGCTATGAAAGCCAGTGATGCCCTCACTCGTCAGGTTTCAGGTATTTTTGCAG 
TTGCAGAAAGCTATCCAGATTTGAAAGCTAGTGCTAACTTTGTTAAATTGCAAGAGGAGTTGACAAACA 
ATAAAATTTCTTACTCTCGTCAACTCrATAACAGTGTTGTCAGCAACTACAATGTAAAATTAGAAAC^^ 
CAATATTATCGCTGGAATGTTTGGATTTAAAGCGGCAGATTTCCTTCAAACACCTGAAGAGGAAAAGTCGGTTCCT 

40 AAAGTTGATTTTAGCGGTTTAGGTGACTAA 
4179.8 

ATGTTGTTTGATCAAATTGCAAGCAATAAACGAAAAACCTGGATTTTGTTGCTGGTATTTT^ 

TGTTGGTTATGCGGTTGGTTATCTCTTTATAAGATCTGGACTTGGTGGTTTGGTTATTGCACTGATTATCGGCT^ 
45 TCTACGCTTTGTCTATGATTTTTCAATCGACAGAGATTGTCATGTCCATGAATGGAGCGCGTGAGGTG GATGA GCA 
AACGGCACCAGACCTCTACCATGTAGTGGAAGATATGGCTCTGGTCGCTCAGATTCCTATGCCCCGTGTTTTCATC 
ATTGATGATCCAGCCTTAAATGCCTTTGCGACAGGTTCTAATCCTCAAAATGCGGCTGTTGCTGCGAC^ 
TACTAGCTATCATGAATCGTGAAGAACTAGAAGCTGTTATGGGACATGAAGTCAGTCATATTCGTAATTATGATAT 
CCGTATTTCGACTATTGCAGTTGCCCTTGCTAGTGCTATCACCATGCTTTCTAGTATGGCAGGTCGTATGATGTGGT 
50 GGGGTGGAGCAGGTCGCAGACGAAGTGATGATGACCGAGATGGAAATGGTCTTGAAATCATTATGCTAGTGGTTT 
CCCTACTAGCTATTGTACTGGCACCTCTCGCTGCAACCITGGTTCAGCTCGCrATTrCTCGTC^ 
GCAGATGCATCTAGTGTCGAGCTGACTCGCAATCCCCAGGGAATGATTAATGCCCTAGATAAGTTGGACAATAGC 
AAACCTATGAGTCGCCACGTCGATGATGCTAGCAGTGCCCTTTATATCAATGATCCTAAGAAAGGTGGGGGGTTC 
CAAAAACTCITTTATACCCACCCACCTATCTCAGAACGGATTGAACGTTTAAAACAGATC^ 

55 

4179.9 

ATGAAATTAAATATTCAAGAAATTCGTAAGCAGTCTGAAGGTTTGAACTTTGAACAAACGTTAGATTTAGTTGAT^ 
ACCTGCGTGCACGTAATCAAGAAATTTTAGATGTAAAAGATATCCTTGCAGTTGGGAAAGTACAATATGAAGACC 
GTATGTATTTCTTAGATTATCAACTATCTTATACCATTGTTCTTGCTTCGAGTCGCAGTATGGAGCCAGTTGAGT^ 
60 GTTGAATCTTATCCAGTCACGGAAGTTTTCATGGAAGGCGCAACTAACCAGCTAGATCAAGAAGTTTTAGATGATG 
ACITGGTCTTGCCCATCGAAAATGGGGAGCTTGACCTTGCTGAGAGTGTATCAGACAATATCCTGCTAAACATTCC 
TATCAAGGTCTTGACGGCTGAAGAAGAAGCTGGTCAAGGATTTATCTCAGGAAATGACTGGCAAATCATGACAGA 
GGAAGAATACCAAGCTCAAAAAGCAGTAAAGAAAGAAGAAAACAGTCCTTTTGCTGGCTTACAAGGACTATTTGA 
CGGAGATGAATAA 

65 
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4179.12 

ATGGAGTTATTTATGAAAATCACAAACTATGAAATCTATAAGTTAAAAAAATCAGGTTTGACCAATCAACAGATTT 
TGAAAGTGCTAGAATACGGTGAAAATGTTGATCAGGAGCTTTTGTTGGGTGATATTGCAGATATCTCAGGTT^ 
TAATCCAGCCGTTTTTATGGAACGTTATTTTCAGATAGACGATGCGCATTTGTCGAAAGAGTTT 
5 TCTTTCTCTATTTTAGATGACTGTTATCCTTGGGATTTGAGTGAAATATATGATGCGCCTGTACrm 

GGGAAATCTTGACCrCCTGAAATTCCCGAAGGTAGCGGTCGTGGGCAGTCGTGCTTGTAGCAAACAGGGAGCTAA 
GTCAGTTGAAAAAGTCATTCAAGGCTTGGAAAATGAACTGGTTATTGTCAGTGGTCTGGCCAAGGGCATT GACAC 
AGCAGCTCATATGGCAGCTCTTCAGAATGGCGGAAAAACCATTGCAGTGATTGGAACAGGACTGGATGTGTTTTA 
TCCTAAAGCCAATAAACGCITGCAAGACTACATCGGCAATGACCATCTGGTTCTAAGTGAATATGGACCTGGTGA 
10 ACAACCTCTGAAATTTCATTTTCCTGCCCGTAATCGCATCATTGCTGGACTTTGTCGTGGTGTGATTGTAGCAGA 

GCTAAGATGCGTTCAGGTAGTCTCATTACGTGTGAGCGAGCAATGGAAGAAGGACGCGATGTCITTGCTATTCCTG 
GTAGCATTTTAGATGGACTATCAGACGGTTGCCATCATTTGATTCAAGAAGGAGCAAAATTGGTCACCAGTGGG 

AAGATGTTCTTGCGGAATTTGAATTTTAA 

15 4181.1 

ATGAAACGTCAATTAGCCnTGGTCGTCrrrAGTGGTGGTCAAGATTCAACAACCTGCC^^ 

ACTATGAAACAGTCGAAGCTGTCACCTTTGCCTACGGCCAACGTCATCACCTCGAAATTCAAATTACTAGAGAAAT 
CGCTAAGGAACAGGGCATTCGTCACCATATCCTCGATATGTCTCTGCTGGGACAAATCACTGCTCAGCCAGACTT^ 
GCGACGATTCATATTTCCTACATTCCTGACAAGCTCTGTGTCGAGTCAAAATCCCTCAAACTATATCTATTTAGCT 
20 ACCGAAACCACGGAGATTTCCACGAAAACTGTATCAACACCATCGGGAAAGACTTGGTCAACTTGCTAGACCCTC 
GCTATTTAGAAGTCTGGGGAAAATTCACrCCGCGCGGTGGCATTTCAATCGACCCCTACTACAACTACGGTAA 
AGGAACTAAGTATGAGGGCITGGCAGAACAACGCCTCTTCCAACACGACCITT 
CTAA 

25 4181.2 

ATGACCGAAACGGTAGAAGATAAAGTAAGTCATTCAATTACTGGGCTTGATATCCTCAAGGGGATAGTTGCTGCG 
GGAGCTGTCATAAGTGGAACCGTTGCAACTCAAACGAAGGTATTTACAAATGAGTCAGCAGTACITGAAAAAAC^ 
GTAGAGAAAACGGATGCTTTGGCAACAAATGATACAGTAGTTCTAGGTACGATATCTACAAGTAATTCAGCGAGT 
TCAACTAGTTTGTCAGCTTCAGAGTCGGCAAGTACATCTGCATCTGAGTCAGCCrCAACCAGCGCTTCGACCTC^ 
30 CAAGTACAAGTGCATCAGAATCAGCAAGTACATCGGCTTCGACAAGTATTTCTGCATCATCTACTGTGGTAGGTTC 
ACAAACAGCTGCCGCTACAGAAGCAACTGCTAAGAAGGTCGAAGAAGATCGTAAGAAACCAGCTAGTGATTATGT 
AGCATCAGTTACAAATGTCAATCTCCAATCTTATGCTAAGCGACGCAAGCGTTCAGTGGATTCCATCGAGCAATTG 
CTGGCTTCTATAAAAAATGCTGCTGTTTTTTCTGGCAATACGATTGTAAATGGCGCCCCT 

TAAACATTGCTAAAAGTGAGACAAAAGTTTATACAGGTGAAGGTGTAGATTCGGTATATCGTGTTCCAATTTAC^ 
35 TAAATTGAAAGTGACAAATGATGGTTCAAAATTGACCTTTACCTATACGGTTACGTATGTGAATCCTAAAACAAA^ 
GATCTTGGTAATATATCAAGTATGCGTCCTGGATATTCTATCTATAATTCAGGTACTTCAACACAAACAATGTTAA 
CCCTTGGCAGTGATCTTGGTAAACCTTCAGGTGTAAAGAACTACATTACTGACAAAAATGGTAGACAGGTT CT 
CTATAATACATCTACAATGACGACGCAGGGTAGTGGGTATACTTGGGGAAATGGTGCCCAAATGAATGGTTTCTTT 

GCTAAGAAAGGATATGGATTAACATCATCTTGGACTGTACCAATTACTGGAACGGA 

40 TACATCCTTTACATTTACCCCTTACGCTGCTAGAACAGATAGAATTGGAATTAACTACITCAATGGTG^ 

GTAGTTGAATCTAGCACGACCAGTCAGTCACnTTCACAGTCTAAGTCACTCTCAGTAAGTGCTAGTCAAAGCGCCT 
CAGCTTCAGCATCAACAAGTGCGTCGGCTTCAGCATCAACCAGTGCCTCGGCTTCAGCGTCAACCAGTGCGTCAG 
CTTCAGCAAGTACCAGTGCTTCAGTCTCAGCATCAACAAGTGCirCAGCCrCAGCATCGACAAGTGCCTCGGC^ 
AGCAAGCACATCAGCATCTGAATCAGCGTCAACCAGTGCTTCGGCTTCAGCAAGTACCAGTGCTTCAGCTTCAGC 

45 ATCAACCAGCGCCTCGGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGCGCCTCGGCCTCAGCAAG 
CACCTCAGCTTCrGAATCGGCCrCAACCAGCGCCTCAGCCrCAGCATCAACGAGTGCTTCGGC^ 
AAGCGCCTCGGGTTCAGCATCAACGAGTACGTCAGCTTCAGCGTCAACCAGTGCTTCAGCCTCAGCATCAACAAG 
TGCGTCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGTGCGTCTGAGTCAGCATCAACGAGTAC 
GTCAGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGTGCGTCAGCCTCAGCATCGACAAGCGCCTC 

50 AGCTTCAGCAAGTACCAGTGCTTCAGCCTCAGCGTCGACAAGTGCGTCGGCCTCAACCAGTGCATCTGAATCGGC 
ATCAACCAGTGCGTCAGCCTCAGCAAGTACTAGTGCATCGGCTTCAGCATCAACCAGTGCCTCGGCTTCAGCGTCA 
ACCAGTGCGTCAGCTTCAGCAAGTACCAGTGCTTCAGTCTCAGCATCAACAAGTGCTTCAGCCTCAGCATCGACA 
AGTGCCTCGGCTTCAGCAAGCACATCAGCATCTGAATCAGCGTCGACAAGCGCCTCAGCTTCAGCAAGTACCAGT 
GCGTCAGCTTCAGCATCAACCAGCGCCTCGGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGCGCC 

55 TCGGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGCGCCTCAGCCTCAGCATCAACGAGTGCTTCG 
GCTTCAGCAAGCACAAGCGCCTCGGGTTCAGCATCAACGAGTACGTCAGCTTCAGCGTCAACCAGTGCTTCAGCC 
TCAGCATCAACAAGTGCGTCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGTGCGTCTGAGTCA 
GCATCAACGAGTACGTCAGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGTGCGTCAGCCTCAGCA 
TCGACAAGCGCCTCAGCTTCAGCAAGTACCAGTGCTTCAGCCTCAGCTCGACAAGTGCGTCGGCCTCAACCAGTG 

60 CATCTGAATCGGCATCAACCAGTGCGTCAGCCTCAGCAAGTACTAGTGCATCAGCTTCAGCATCAACGAGTGCAT 
CGGCTTCAGCATCAACCAGTGCCTCGGCTTCAGCGTCAACCAGTGCGTCAGCTTCAGCAAGTACCAGTGCTTCAGT 
CTCAGCATCAACAAGTGCTTCAGCCTCAGCATCGACAAGTGCcTCGGCTTCAGCAAGCACATCAGCATCTGAATCA 
GCGTCGACAAGCGCcTCAGCTTCAGCAAGTACCAGTGCGTCAGCCTCAGCGTCGACAAGTGCGTCAGCCTCAGCA 
AGTACTAGTGCATCAGCTTCAGCATCAACGAGTGCATCGGCTTCGGCGTCAACCAGTGCATCAGAGTCAGCAAGT 

65 ACCAGTGCGTCAGCTTCCGCATCAACAAGTGCCTCGGCTTCAGCAAGCACCAGTGCGTCGGCTTCAGCAAGTACT 
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AGCGCCTCAGCCTCAGCCTCAACCAGTGCGTCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGT 
GCGTCCGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCGTCAACAAGTGCATCGGCTTCAGCGTCAACGAGTGCG 
TCTGAATCGGCATCAACGAGTGCGTCCGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCGTCAACAAGTGCATCG 
GCTTCAGCATCAACGAGTGCGTCCGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCGTCAACAAGTGCATCGGCTT 
5 CAGCGTCAACGAGTGCGTCTGAGTCAGCATCAACGAGTGCGTCAGCCTCAGCAAGCACATCAGCTTCTGAATCTG 
CATCAACCAGTGGGTCAGCCTCAGCATCGACAAGCGCCTCAGCTTCAGCAAGTACCAGTGCGTCAGCCTCAGCGT 
CGACAAGTGCGTCGGCTTCAGCAAGTACCAGTGCGTCAGCCTCAGCAAGTACCAGTGCGTCAGCCTCAGCGTCGA 
CAAGTGCGTCGGCCTCAACCAGTGCATCTGAATCGGCATCAACCAGTGCGTCAGCCTCAGCAAGTACTAGTGCAT 
CAGCTTCAGCATCAACGAGTGCATCGGCTTCAGCATCAACCAGTGCATCAGAGTCAGCAAGTACCAGTGCGTCAG 

1 0 TTCCGCATCAACAAGTGCCTCGGCTTCAGC AAGTACTAG 

4183.1 

ATGGGGGTCGAAACTTGGTTTTATTCTAGCATCTGCTGGCTGGCCATCGGGCTTGGTTCCGTTTGGAAGTTTC^ 

ACATGACTGCTGCTAATGGCGGTGGAGGCTTTTTACTAATCTTTCTCATTTCCACTATTT^ 
15 CTGCTGGCTGAGTTTGCCCTTGGCCGTAGTGCTGGCGTTTCCGCTATCAAAACCTTTGGAAAACTGGGC^ 
ACAAGTACAACTTTATCGGTTGGATTGGCGCCTTTGCCCTCTTTATCCTCTTATC^^ 

TGGATTCTAGTCTATCTAGGTATTGAGTTTGGGAAATTGTTCCAACTTGGTGGAACGGGTGATTATGCTCAGT^^ 
TTACTTCAATCATTTCAAATCCAGCCATTGCCCTAGGAGCTCAAGCGGCCTTTATCC TATT GA^^ 
TCACGTGGGGTTCAAAAAGGGATTGAAAGAGCTTCGAAAGTCATGATGCCCCTGCTCTTTATCC^ 
20 TCATCGGTCGCTCTCTCAGTTTGCCAAATGCCATGGAAGGGGTTCTTTACTTCCTCA/^ 
GACTAGCACTGGTCTCCTCTATGCTCTGGGACAATCTTTCnTTGCCCTCTCACr^ 
ATGCnrCTTACITAGACAAGAAAACCAATCTAGTCCAGTCAGGAATCTCCATCGTAGCCATG 

CATCATGGCAGGTCTAGCCATTTTCCAAGCTCGATCCCCCTTCAATATCCAGTCTGAAGGGGGACCCAGCCTGCTC 

TTTATCGTCTTGCCTCAACTCTTTGACAAGATGCCTTTTGGAACCATTTTCT 
25 TTTGCGACAGTCACTTTTTCTGTCGTGATGCTGGAAATCAATGTAGACAATATCACCAACCAGGATAACAGCAA^ 

GTGCCAAATGGAGTGTTATTTTAGGAATTTTGACCTTTGTCTTTGGCATTCCT^ 

GCGGATGTTCACATTTTTGGTAAGACCTTCTTTGACGCTATGGACTTCTTGGTTTCC 

AGCTCTCTACCTTTCACTTTTTACAGGCTATATCTTTAAAAAGGCTCTTGCAA 

AGAGCATGGAAACAAGGACTGTTCCAAGTCTGGCTCTTCCTTCTTCGTTTCTTCGTTTCGTCATTCCAATCATC 
30 ATTGTGGTCTTCATTGCCCAATTTATGTAATCAAAAAGGACTTGAGTAG 

4183.5 

ATGTTGAAAAAATGGCAGTTAAAAGATGTTATCITGCTTGCTTTCTTGTCTATCIT^ 

TTCAGGATATGTGTATAATATTCTCAGTCTACTCTTAACACCTCTTGGTTTGCAGGCCTTTGCCAATGAAATCCTCT 
35 TCGGTCTCTGGTGTATGGCTGCGCCCATTGCTGCCATCTTTGTTCCGAGAGTCGGAAGTGCAACGATTGGAGAAGT 
GCTAGCTGCGCTTGCTGAAGTCCTTTATGGTAGCCAATTTGGTCTAGGAGCTCTTTTGTCTGGCT^ 
TGGGAAGTGAATTTGGTTTTATCGTAACTAAGAATCGCTATGAAAGTTGGCTCTCTCTAACTGCTAAT 
GATTACGCTTGTTAGCTTTGTCTATGAATACATTAAGTTAGGTTACTACGCCTTTTCCCTTCCGT^ 
GCTTGTGGTACGTTTTATTTCTGTTTATTTCTTCTGTACCATCTTGGTTCGTGCCATT^^ 
40 TGCAACTGGAGGAAAAGCATAG 

4183.6 

ATGGTCAAAGTAGCAACCCAGACACCGATTATCAGTCTCTTCTTGCTGATTTTATCCTTGGAAACATCTTTCA^^ 
TTCGATTGCTCTGACTCTTTCGGTAGTCGCATTTTGTATTCTCTTTATGCTCTATTACCGTCGAT^ 
45 CTTGGATGATCATACnTGCCATTTTACCATCTTTTGCCAACTACTGGGCAGTTCAGTTACA 

GGCAGTCATGCTTGGAACGAGGGCCTTTGTGACAGTTTGTATCGGCCTTGTCTTTGTTTCCTCT 
AGCnrCrCITGTACTTGGCTCAAAAGGGGCTATCACGCTCTTGGTCCTATGCCTTGATTG 

CCTCTCATTCAGCAAGAAATCAAGTCCCTCAAAGAAGCTTGCCTATTACGTGGTCAAGAACTACAri l"i iGGTCGC 
CCrTGATTTACAGTAAGGTTCTGATGACAGTCTTTAGGTGGCGCCATCTTTACCTGAGAG 
50 ATATGACGAACATGCACAGTTGAAGAATAGCTATCGGACTTTTTATATTCCTAAAAAAACAAAATTAATCT 

CTTTTCTTTTTATTGCITCAAACCAGTCrATTTT^ 
4183.7 

ATGAGAAAGCACCAATTACAAGTTCACAAATTAACCATTTTATCTATGATGATTGCCCTTGATGTAGTCCTTACAC 
55 CTATCTTTCGAATTGAGGGAATGGCACCGATGTCCAGTGTAGTCAATATTCTAGCAGGAATCATGATGGGACCTGT 
TTATGCCTTGGCTATGGCTACAGTCACAGCCTTTATCCGTATGACGACTCAAGGGATTCCGCCTTTAGCT 
GGAGCGACTTTTGGAGCCCTTCTAGCAGGTCTCTTTTATAAGTACGGTCGAAAATTTCACTATTCTGCTCTA 
AGATTTTGGGAACAGGTATTATTGGTTCCATTGTTTCCTATCCTGTTATGGTACTCTTTACAGGATCAGCTGCTAAG 
CTTAGCTGGTTTATCTACACGCCTCGATTTTTCGGAGCAACCTTGATTGGTACAGCGATTTCCT^ 
60 ATTTTTAATCAAGCAGGAATTCTTTAAAAAAGTGCAGGGATATTTCTTTAGTGAAAGGATAGACTGA 

4183.8 

ATGCAGGAATTTACAAATCCCTTTCCTATAGGCTCTAGTTCCCTCATTCACTGCATTACCAATGAGATTTCTTGTGA 
GATGCTGGCAAATGGGATTTTGGCrCTGGGATGCAAACCTGTCATGGCAGATGATTCCCGTGAAGTTCTTGATTTT 
65 ACTAAGCAAAGTCAGGCTCTCTTCATCAATTTGGGGCATTTGTCAGCTGAGAAGGAAAAAGCAATCCGCATGGCA 
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GCTTCGTATGCAAACCAATCTTCTCTCCCGATGGTAGTAGATGCGGTTGGCGTAACGACTTCATCCATTCGTAAGA 

GCTTAGTTAAAGACCTTTTAGACTATAGACCTACGGTCCTTAAAGGAAACATGTCAGAAATTCGAAGTCTTGT^ 

ATTAAAGCACCACGGCGTTGGGGTCGATGCGAGTGCTAAAGATCAAGAAACGGAGGATTTGCTTCAAGTCTTGAA 

AGACTGGTGTCAGACCTATCCTGGTATGTCTTTCTTAGTCACAGGTCCCAAGGACCTCGTCGTTTCGAAAAATCAG 

GTCGCTGTACTGGGAAATGGCTGTACTGAATTAGACTGGATAACAGGGACAGGAGACTTGGTTGGAGCCTTAACA 

GCTGTTTTTCTCAGCCAAGGAAAGACTGGTTTTGAAGCTTCTTGCTTAGCAGTCrCTTATCT 

GAAAATAGTTGTTCAAGGAATGGGATTGGAAGAATTTCGTTACCAAGTACTCAATCAGCITTCGCTCCT 

AGATGAAAATTGGCTAGATACCATCAAAGGAGAGGTTTATGAATAG 



4185.3 

ATGAACCATAAAATCGCAATTTTATCAGATGTTCATGGCAATGCGACGGCGCTAGAAGCAGTGATTGCAGATGCT 

AAAAATCAAGGGGCCAGTGAATATTGGCTTCTGGGAGATATTTTTCITCCrGGTCCAGGCG 

CCCTGCTAAAGGACCITCCTATCACAGCAAGTGTTCGAGGCAATTGGGATGATCGTGTCCITGAGGC^^ 

GCAATATGGCnTAGAAGACCCACAGGAAGTTCAGCTCTTGCGTATGACACAGTATTTGATGGA GCGAA TGGATCC 

TGCAACGATTGTCTGGCTACGAAGCITGCCTTTGCTGGAAAAGAAAGAAATTGACGGATTGCGC^^ 

CATAATTTACCTGACAAAAACTATGGTGGTGACTTGCTAGTTGAGAATGATACAGAGAAATTTGACCAACT 

GATGCGGAAACGGACGTGGCAGTTTATGGTCATGTTCACAAGCAGTTGCTTCGTTATGGAAGTCAAGGGCAACAA 

ATCATCAATCCAGGGTCGATTGGCATGCCCTATTTTAATTGGGAGGCGTTAAAAAATCACCGTTCCCAGTATG^ 

TGATAGAAGTTGAAGATGGGGAATTACTCAATATCCAATTTCGTAAAGTTGCTTATGATTACGAAGCTGAG^ 

ATTGGCCAAGTCCAAGGGGCTTCCCTTTATCGAAATGTATGAAGAACTGCGTCGTGACGATAACTATCAGGGGCA 

CAATCTGGAATTATTAGCCAGCnTAATAGAAAAGCATGGGTATGTAGAGGATGTGAAGAATTTTTTTGAT^^ 

TAA 

4186.1 

ATGAATGTAAATCAGATTGTACGGATTATTCCTACTTTAAAAGCTAATAATAGAAAATTAAATGAAACATT^ 

TTGAAACCCTTGGAATGAAGGCCTTGTTAGAAGAATCGGCCTTTCTGTCACTAGGTGACCAAACGGGTCTTGA^^ 

GCTGGTTTTAGAAGAAGCTCCCAGTATGCGTACTCGTAAGGTAGAGGGAAGAAAAAAACTAGCTAGATTGATTGT 

CAAGGTGGAAAATCCCTTAGAAATTGAAGGAATCTTATCTAAAACAGATTCGATTCATCGATTATATAAAGGTCA 

AAATGGCTACGCTTTTGAAATTTTCTCACCAGAAGATGATTTGATTTTG 

CTAGTAGAAGTAGGAGAAAAGCCTGAATTTCAAACAGATTTGGCATCAATTTCTTTAAGTA^ 

TGGAATTACATCTCCCAACTGATATCGAAAGTTTCTTGGAATCATCTGAAATTGGGGCATCCCITGAT^^ 

AGCTCAGGGGCAGGATTTGACTGTGGACAATACGGTTACCTGGGACTTATCTATGCTCAAGTTCTTGGTCAATGAA 

TTAGACATAGCAAGTCITCGCCAGAAGTTTGAGTCrACTGAATATTTTATTCCTAAGTCTGAAA^ 

GTAAAGATAGAAATAATGTTGAATTGTGGTTTGAAGAAGTATGA 

4186.2 

ATGAAGTGGACCAAGATTATTAAAAAAATAGAAGAACAAATCGAGGCAGGGATTTATCCCGGAGCCTCTTTT 

TATTTTAAGGACAATCAATGGACAGAGTTCTATTTAGGCCAGAGTGACCCAGAGCATGGCTTGCAGACTGAGGC^ 

GGACTAGTTTATGACCTAGCTAGTGTCAGCAAGGTTGTTGGGGTTGGCACAGTTTGTACCnTCITGTGG 

GTCAATTAGATATTGATAGACTGGTAATAGATTTTTTACCTGAGAGTGATTATCCAGACATCACTATTCGCCAG 

CTTGACTCATGCAACAGAC(nTGATCCTTTTATTCCTAATCGTGATCnTTTAACAGCCCCTGAA^ 

ATGTTTCATCTCAACAGACGAAGTCAGCCAGCCTTTCTTTATTCGGATGTCCATTTT^ 

GGAAAGAATTTTTAATCAAGATTTGGATGTGATTTTAAAGGATCAAGTCTGGAAACC^ 

TAAGTTTGGGCCAGTTGAGCTTGCTGTTCCAACAGTTAGAGGTGTAGAGGCAGGCATAGTGCATGATCCCAAGGC 

TCGTCTCCTGGGTAGACATGCTGGGAGTGCTGGTTTATTTTCGACTATAAAGGATTTACAAATCT^^ 

TATTTAGCAGATGATTTTGCAAGAGACnTAAATCAAAATTTTTCTCCTTTGGATC 

GGAATTTGGAAGGAGATTGGCTAGACCATACGGGCTATACAGGTACCTTTATCATGTGGAATCGTCAGAAGCAAG 
AAGCCACTATTTTCCTATCGAATCGTACCTATGAAAAGGACGAGAGAGCTCAATGGATATTAGACCGCAATCAAG 
TGATGAACTTGATTCGCAAAGAAGAGTAA 

4187.2 

ATGATGAAGAAGACTTATAATCATATTTTGGTCTGGGGAGTCATTTTCTATAGCATTTGCATTGTCTGTTT^ 

TACTCCTCAAGAACAATCTACCGTGGGAGTGGGAACTCCAGGTATTCAGCATCTTGGACGCCTGGTTTTTCTTTTG 

ACTCCTTTCAATTCrCTCTGGAAACTGGGCGAAGTGAGTGACATTGGACAATTATGTTGGATTTT^^ 

GTCCTTCTTTTTAGTTTTCTTGTGAGTCTTGGAATCGAGTGTACGCAATTAATCTTGGACT^^ 

CGCGTCTTTGAGATTGATGATTTGTGGACCAACACTTTGGGTGGCTATCTGGCTTGGCTCCTTTATAAACGATTAC 

ATAAAAACAAGGTAAGGAATTAA 
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4188.1 

ATGAAGATTCCTCTCTTAACITTTGCAAGGCATAAATTTGTTTATGTC^ 

TATCGTGATGTTTTGATGACTTATTTCTTTTTTGATATTCATGCGCCCGATCTAGCTAAATTCGATG 
TAAAAATGACTTATTAAAATCAGCATTAGATTTTCGTATTCTCCAGTTCAATCTAGGTT^ 
5 TTCCAATCATCATTGTTTTGCTAGGTTTTCAATATATTGAGCTGAAAAATAAAGTTTTACGATTGAGTA^^ 
AGAAGTGAGTTATCAAGGGTTAAAAAGAAAGTTGACITTGCAAGTTGCAAGTATCCCTTGTTTG 
ACTGTGCTGATAATTGCAATTATAACCTATTTCTTTGGGACm^ 

GGAAGTGGTTTACAAAGACTCCTAGATGGAGAGATAAAAAGCTATTTGTTCTTTACTTGTGTCCT 
TTTTCATCAATGCAATCTATTTTTTACAAATAGTTGATTATGTGGGGAATGTGACT^ 
10 ATGTTTCTTTGGCTTGGTTCTATGCTGCTTTATAGTGCCTTGCCnTACTATATGG^ 

AGCTAGCTATGGGGATGTAAGTTTGATGAAACTCnTTACTCCITATATCCTTTATATTGTCC^ 

AAAAATATGAAGATAATGTTTAA 
4188.2 

15 ATGAAGATAATGTTrAAGAATTTTAACAATATTTTGCTAAATAGAAAGAT^^ 

GATGATTTTGATAAACCATCTATTGTCAACAGCGGrrCAAAAGCAGGATGCTGTTATCrrilTCAAGAGAG 

ATTTCAATTTTTTCCTATAATGACTATTCTGAAGCGAATTTAGAAATCCCCA^ 

CATGGTAGGATGGCTCTCTGTCATTTTACTTGAAAGTGATTTGGCAGACCATTACCATCACTTGATTCGCTATC^ 
TCAAGCTCCTTTTTCGATTATACAAGGAAACGATTGGTTGTCATTTCTAAAT^^ 
20 GTTTCnTGGTTTACITCCTCTAGGAATTCATTTCAAAACAGTCGCACT^ 

GTACTTACTACTGTCTTATCTGATAGCACTGATTAGTGCGGGCGCTGGTTTTTCCT^^ 

TGTGGGACAAGAATGGATGATGGATCATATTGTAACAGTGTATTTAGTACTCTTAAGTTTATTAGTTATGTTGATT 
GTTAGTCGCTTGGAAGAGAAATTTAAGAAAGGATAA 

25 4188.5 

ATGGGCAAAGGAGAGATGGGCAAAGGAGTTATTGGCTTGGAGTTCGACTCAGAAGTATTGGTCAACAAGGCTCCA 
ACCCITCAATTGGCAAATGGTAAAACAGCGACTTTCCTAACCCAGTATGATAGCAAGACCTTGTTGT^ 
ATAAGGAAGATATCGGACAGGAAATTATTGGTATAGCTAAAGGAAGCATCGAAAGTATGCATAATCTTCCTGTAA 
ATCTAGCAGGTGCCAGAGTTCCTGGCGGAGTAAATGGTAGCAAAGCAGCGGTGCATGAAGTTCCAGAATTTACAG 
30 GGGGAGTTAATGGTACAGAGCCAGCTGTTCATGAAATCGCAGAGTATAAGGGATCTGATTCGCTTGTAACTCTTAC 
TACAAAAAAAGATTATACm^ACAAAGCTCCTCITGCTCAGCAGGCACTTCCTGAAACAGGAAACA^ 
CCTCCTAGCTTCACTAGGACTAACAGCTTTCnTCCTTGGTCTGTTTACGCTAGGGAA^ 

4188.10 

35 ATGTTTAAAGTTTTACAAAAAGTTGGAAAAGCTTTTATGTTACCTATAGCTATACTTCCT 

GGGGATTGGTGGTGCACnTTCAAACCCAACCACGATAGCAACTTATCCAATACTAGACAATAGTAi 1 iiiCAATCA 
ATATTCCAAGTAATGAGCTCTGCAGGAGAGGTTGTATTCAGTAATTTGTCACTACTTCTCTGTGTGGGATTAT^ 
TTGGCTTAGCGAAACGAGATAAAGGAACCGCTGCGTTAGCAGGAGTAACTGGTTACTTAGTTATGACTGCAACG 
TCAAAGCTTTGGTAAAACnTTTTATGGCAGAAGGATCTGCAATTGATACTGGAGT^ 
40 AATAGTTGCCGTATATTTGCACAACCGATATAACAATATTCAATTACCITCCGCTTTAGGATT 
CGCTTCGTTCCTATTGTTACATCGTTCTCnTCTATCTTGATTGGCTTTGTC^ 
CAACTTCITGTTTCTACAGGTGGATATATTTCTCAGGCGGGTCCAATTGGAACTT^ 
ACTTTCTGGAGCAGTAGGCTTACATCATATAATTTACCCTATGTTTTGGTATACTG^ 
TTGCAGGACAAACAGTGGTTGGAGCrCAAAAAATATTTTTTGCTCAATTAGCCGATTTGGCCCATTCT 

45 TACAGAAGGAACAAGGTTTTTTGCAGGTCGTTTCTCAACAATGATGTTCGGTTTACCG 

TACCATAGTGTTCCTAAAAATCGTCGTAAAAAATACGCGGGTTTGTTTTTTGGAGTTGCTT^ 
CGGTATTACAGAACCAATTGAATTTATGTTTCTATTCGTCAGTCCGGTTCTATATGTTGTTCACGCATTC^ 
GTGTTAGCITCnTTATTGCAGACGTCTTAAATATTTCAATAGGAAACACATTTTCAGGAGGTGT^ 
TTATTTGGAATTTTGCAGGGGAACGCTAAGACGAATTGGGTTCTTCAGATTCCATTTGGACTTAT^ 

50 GTATTATATTATTTTTAGATGGTTCATTACTCAATTCAACGTTCTAACGCCAGGGCGAGGAGAAGAAGTAGATTCT 
AAAGAAATTTCTGA^TCCGCAGATTCAACTTCAAATACTGCAGATTATTTAAAACAGGATAGCCTACAAATTA^ 
GAGCCTTGGGTGGATCAAATAATATAGAAGATGTAGATGCITGTGTGACACGTTTACGTGTAGCTGTAAAAGAAG 
TTAATCAAGTTGATAAAGCACTTTTAAAACAAATTGGTGCAGTTGATGTCTTAGAAGTGAAGGGTGGCATTCAAGC 
AATCTATGGAGCAAAAGCAATCTTATATAAAAATAGTATTAATGAAATTTTAGGTGTAGATGATTAA 

55 

4188.11 

ATGAAATTTAGAAAATTAGCTTGTACAGTACTTGCGGGTGCTGCGGTTCTTGGTCITGCTGCTTGTGGCAA^ 

GCGGAAGTAAAGATGCTGCCAAATCAGGTGGTGACGGTGCCAAAACAGAAATCACTTGGTGGGCATTCCCAGTAT 

TTACCCAAGAAAAAACTGGTGACGGTGTTGGAACTTATGAAAAATCAATCATCGAAGCGTTTGAAAAAGCAAACC 

60 CAGATATAAAAGTGAAATTGGAAACCATCGACTTCAAGTCAGGTCCTGAAAAAATCACAACAGCCATCGAAGCAG 
GAACAGCTCCAGACGTACTCTTTGATGCACCAGGACGTATCATCCAATACGGTAAAAACGGTAAATTGGCTGAGT 
TGAATGACCTCTTCACAGATGAATTTGTTAAAGATGTCAACAATGAAAACATCGTACAAGCAAGTAAAGCTGGAG 
ACAAGGCTTATATGTATCCGATTAGTTCTGCCCCATTCTACATGGCAATGAACAAGAAAATGTTAGAAGATGCTGG 
AGTAGCAAACCTTGTAAAAGAAGGTTGGACAACTGATGATTTTGAAAAAGTATTGAAAGCACITAAAGACA^ 

65 TTACACACCAGGTTCATTGTTCAGTTCTGGTCAAGGGGGAGACCAAGGAACACGTGCCTTTATCTCTAACCTTTAT 
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AGCGGTTCTGTAACAGATGAAAAAGTTAGCAAATATACAACTGATGATCCTAAATTCGTCAAAGGTCTTGAAAAA 
GCAACTAGCTGGATTAAAGACAATTTGATCAATAATGGTTCACAATTTGACGGTGGGGCAGATATCCAAAAC^ 
GCCAACGGTCAAACATCTTACACAATCCTTTGGGCACCAGCTCAAAATGGTATCCAAGCTAAACTTTTAGAAGCA 
AGTAAGGTAGAAGTGGTAGAAGTACCATTCCCATCAGACGAAGGTAAGCCAGCTCTTGAGTACCTTGTAAACGGG 
5 TTTGCAGTATTCAACAATAAAGACGACAAGAAAGTCGCTGCATCTAAGAAATTCATCCAGTTTATCGCAGATGAC 
AAGGAGTGGGGACCTAAAGACGTAGTTCGTACAGGTGCTTTCCCAGTCCGTACITCATTTGGAAAACTT^ 
ACAAACGCATGGAAACAATCAGCGGCTGGACTCAATACTACTCACCATACTACAACACTATTGATG GATT TGCTG 
AAATGAGAACACITTGGTTCCCAATGTTGCAATCTGTATCAAATGGTGACGAAAAACCAGCAGATGCrrTT 
CCTTCACTGAAAAAGCGAACGAAACAATCAAAAAAGCTATGAAACAATAG 

10 

4188.12 

ATGCAATCTACAGAAAAAAAACCATTAACAGCCTTTACrOTTATTTCAACAATCATTT^ 
TGTTCATCTTTCCATTCTACTGGATTTTGACAGGGGCATTCAAATCACAACCTGATACAATTGTTATTCCT 
TGGTTCCCTAAAATGCCAACCATGGAAAACTTCCAACAACrCATGGTGCAGAACCCTGCCTTGCAATGGAT^ 
15 AACTCAGTATTTATCTCATTGGTAACCATGTTCTTAGTTTGTGCAACCTCATCTCTAGCAGGTTAT^ 
AAAACGTTTCTATGGTCAACGCATTCTATTTGCTATCTTTATCGCrGCTATGGCGC^ 
TACCATTGGTACGTATCGTCAACTTCATGGGAATCCATGATACTCTCTGGGCAGTTATCITGCCn^ 
GCCATTCGGTGTCTTCCTCATGAAACAGTTCAGTGAAAATATCCCTACAGAGTTGCnTGAATCAGCrAA^ 
GGTTGTGGTGAGATTCGTACCTTCTGGAGTGTAGCCTTCCCGATTGTGAAACCAGGGTTTGCAGCCCTTGCA^ 

20 TTACCTTCATCAATACTTGGAATGACTACTTCATGCAATTGGTAATGTTGACTTCAC^ 

CnTGGGGTTGCGACCATGCAGGCTGAAATGGCAACCAACTATGGTTTGATTATGGCAGGAGCTGCCCTTGCTGCT 

TTCCAATCGTCACAGTCTTCCTAGTCITCCAAAAATCCTTCACACAGGGTATTACTATGGGAGCGGTCAAAGGA 

A 

25 4191.1 

ATGAAAAAAACTTTTTTCTTACTGGTGTTAGGCTTGTTTTGCCTTCTTCCACTCTCT 

ATAAACTCTTATCAAGGGGATTTGTATATTCATGCAGACAATACGGCAGAGTTTAGACAGAAGATAGTTTACCAGT 

TTGAGGAGGACmTAAGGGCCAAATCGTGGGACTTGGACGTGCTGGTAAGATGCCTAGCGGGTTTGACATTGACC 

CTCATCCAAAGATTCAGGCCGCGAAAAACGGTGCAGAACTAGCAGATGTGACTAGCGAAGTAACAGAAGAAGCG 

30 GATGGTTATACTGTGAGAGTCTATAATCCAGGTCAGGAGGGCGACATAGTTGAAGTTGACCTCGTCTGGAACTTA 
AAAAATTTACITTTCCTTTATGATGATATCGCTGAATTAAATTGGCAACCTCTGACAGATAGT^ 
AAAAGTTTGAATTTCATGTAAGGGGAGACAAGGGGGCTGAAAAACTCTTTTTCCATACAGGGAAACm 
AGGGAACGATTGAAAAGAGTAACCTTGATTATACTATCCGTTTAGACAATCITCCGGCTAAGCGTGGAGTTGAG 
GCATGCCTATTGGCCTCGGACCGATTTTGCTAGCGCTAGGGATCAGGGATTGAAAGGGAATCGTTTAGAAGAGTTT 

35 AATAAGATAGAAGACTCGATTGTTAGAGAAAAAGATCAGAGTAAACAACTCGTTACTTGGGTCCTCCCITCGA 
CTTTCCATCTCCTTGTTATTGAGTGTCTGCTTCTATTTTATTTATAGAAGAAAGACCACTCCTT 
CAAAAATCATCGTCTCTATGAACCACCAATGGAATTAGAGCCTATGGTTTTATCAGAAGCAGTCTACTCGACCTCC 
TTGGAGGAAGTGAGTCCCTTGGTCAAGGGAGCTGGAAAATTCACCTTTGATCAACITATTCAAGCTACCTTG 
ATGTGATAGACCGTGGGAATGTCTCTATCATTTCAGAAGGAGATGCAGTTGGTTTGAGGCTA GTAA AAGAAGATG 

40 GTTTGTCAAGCTTTGAGAAAGACTGCCTAAATCTAGCTTTTTCAGGTAAAA^ 

TGCGGATTACAAGGTATCTGATAGTCTTTATCGTAGAGCCAAAGTTTCTGATGAAAAACGGATTCAAGCAAGAGG 
GCTTCAACTCAAATCTTCTTTTGAAGAGGTATTGAACCAGATGCAAGAAGGAGTGAGAAAACGAGT^ 
GGGCrCCCAGATTATTATCGTCCTTTAACTGGTGGGGAAAAGGCCTTGCAAGTGGGTATGGGTGCC^ 
TGCCCCrATTTATCGGATTTGGTTTGTTCTTGTACAGTTTAGACGTTCATGGCTATCTT^ 

45 CITGGTTTTCTAGGGTTAGTTTTGTCTGTTTTCTATTATTGGAAGC^ 

TGAAGCGGGAGCTGAGGTCTACTATCTCTGGACCAGTTTTGAAAATATGTTGCGTGAGATTGCACGATTGGATCAG 
GCTGAACTGGAAAGTATTGTGGTCTGGAATCGCCTCTTGGTCTATGCGACCTTATTTGGCTATGCGGACAAGGTTA 
GTCATTTGATGAAGGTTCATCAGATTCAAGTGGAAAATCCAGATATCAATCTCTATGTAGCTTATGGCTGGCACAG 
TACGTTTTATCATTCAACAGCACAAATGAGCCATTATGCTAGTGTCGCAAATACAGCAAGCACCTACTCTGTATCT 

50 TCTGGAAGTGGAAGTTCTGGTGGTGGCTTCTCTGGAGGCGGAGGTGGCGGCAGTATCGGTGCCTTTTAA 

4191.2 

ATGAAAAAAGTAAGAAAGATATTTCAGAAGGCAGTTGCAGGACTGTGCTGTATATCTCAGTTGACAGCTTTTTCn^ 
CGATAGTTGCTTTAGCAGAAACGCCTGAAACCAGTCCAGCGATAGGAAAAGTAGTGATTAAGGAGACAGGCGAAG 

55 GAGGAGCGCTTCTAGGAGATGCCGTCTTTGAGTTGAAAAACAATACGGATGGCACAACTGTTTCGCAAAGGACAG 
AGGCGCAAACAGGAGAAGCGATATTTTCAAACATAAAACCTGGGACATACACCTTGACAGAAGCCCAACCTCCAG 
TTGGTTATAAACCCTCTACTAAACAATGGACTGTTGAAGTTGAGAAGAATGGTCGGACGACTGTCCAAGGTGAAC 
AGGTAGAAAATCGAGAAGAGGCTCTATCTGACCAGTATCCACAAACAGGGACTTATCCAGATGTTCAAACACCTT 
ATCAGATTATTAAGGTAGATGGTTCGGAAAAAAACGGACAGCACAAGGCGTTGAATCCGAATCCATATGAACGTG 

60 TGATTCCAGAAGGTACACTTTCAAAGAGAATTTATCAAGTGAATAATTTGGATGATAACCAATATGGAATCGAATT 
GACGGTTAGTGGGAAAACAGTGTATGAACAAAAAGATAAGTCTGTGCCGCTGGATGTCGTTATCTTGCTCGATAA 
CTCAAATAGTATGAGTAACATTCGAAACAAGAATGCTCGACGTGCGGAAAGAGCTGGTGAGGCGACACGTTCTCT 
TATTGATAAAATTACATCTGATTCAGAAAATAGGGTAGCGCTTGTGACTTATGCTTCCACTATCTTTGATGGGACC 
GAGTTTACAGTAGAAAAAGGGGTAGCAGATAAAAACGGAAAGCGATTGAATGATTCTCTTTTTTGGAATTATGAT 

65 CAGACGAGTTTTACAACCAATACCAAAGATTATAGTTATTTAAAGCTGACTAATGATAAGAATGACATTGTAGAAT 
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TAAAAAATAAGGTACCTACCGAGGCAGAAGACCATGATGGAAATAGATTGATGTACCAATTCGGTGCCACTTTTA 
CTCAGAAAGCTTTGATGAAGGCAGATGAGATTTTGACACAACAAGCGAGACAAAATAGTCAAAAAGTCATTTTCC 
ATATTACGGATGGTGTCCCAACTATGTCGTATCCGATTAATTTTAATCATGCTACGTTTGCTCCATCATATCAAAAT 
CAACTAAATGCATTTTTTAGTAAATCTCCTAATAAAGATGGAATACTATTAAGTGAT^^ 
5 GTGGAGAACATACAATTGTACGCGGAGATGGGCAAAGTTACCAGATGTTTACAGATAAGACAGTTTATGAAAAAG 
GTGCTCCTGCAGCTTTCCCAGTTAAACCTGAAAAATATTCTGAAATGAAGGCGGCTGGTTATGCAGTTATAGGCGA 
TCCAATTAATGGTGGATATATTTGGCTTAATTGGAGAGAGAGTATTCTGGCTTATCCGTTTAATTCTAATACT 
AAATTACCAATCATGGTGACCCTACAAGATGGTACTATAACGGGAATATTGCTCCTGATGGGTATGATGTCTTTAC 
GGTAGGTATTGGTATTAACGGAGATCCrGGTACGGATGAAGCAACGGCTACTAGTTTTATGCAAAGTATTTCTAGT 

10 AAACCTGAAAACTATACCAATGTTACTGACACGACAAAAATATTGGAACAGTTGAATCGTTATTTCCACACCATC 
GTAACTGAAAAGAAATCAATTGAGAATGGTACGATTACAGATCCGATGGGTGAGTTAATTGATTTGCAATTGGGC 
ACAGATGGAAGATTTGATCCAGCAGATTACACTTTAACTGCAAACGATGGTAGTCGCTTGGAGAATGGACAAGCT 
GTAGGTGGTCCACAAAATGATGGTGGTTTGTTAAAAAATGCAAAAGTGCTCTATGATACGACTGAGAAAAGGATT 
CGTGTAACAGGTCTGTACCTTGGAACGGATGAAAAAGTTACGTTGACCTACAATGTTCGTTTGAATGATGAGTTTG 

15 TAAGCAATAAATTTTATGATACCAATGGTCGAACAACCTTACATCCTAAGGAAGTAGAACAGAACACAGTGCGCG 
ACTTCCCGATTCCTAAGATTCGTGATGTGCGGAAGTATCCAGAAATCACAATTTCAAAA^ 
ACATTGAGTTTATTAAGGTCAATAAAAATGATAAAAAACCACTGAGAGGTGCGGTCITTAGTC^ 
ATCCGGATTATCCAGATATTTATGGAGCTATTGATCAAAATGGCACrrATCAAAATGTGAGAACAGGTGAAGATG 
GTAAGTTGACCTTTAAAAATCTGTCAGATGGGAAATATCGATTATTTGAAAATTCTGAACCAGCTGGT^ 

20 CGTTCAAAATAAGCCTATCGTTGCCrrCCAAATAGTAAATGGAGAAGTCAGAGATGTGACr^ 

GATATACCAGCGGGTTACGAGTTTACGAATGATAAGCACTATATTACCAATGAACCTATTCCTCCAAAGAGAGAA 
TATCCTCGAACTGGTGGTATCGGAATGTTGCCATTCTATCTGATAGGTTGCATGATGATGGGAGGAGTTCTATTAT 
ACACACGGAAACATCCGTAA 

25 

4191.3 

ATGAAATCAATCAACAAATTTTTAACAATGCTTGCTGCCTTATTACTGACAGCGAGTAGCCrGTT^ 
CAGTTTTTGCGGCTGGGACGACAACAACATCTGTTACCGTTCATAAACTATTGGCAACAGATGGGGATATGGATA 
30 AAATTGCAAATGAGTTAGAAACAGGTAACTATGCTGGTAATAAAGTGGGTGTTCTACCTGCAAATGCAAAAGAAA 
TTGCCGGTGTTATGTTCGTTTGGACAAATACTAATAATGAAATTATTGATGAAAATGGCCAAACTCTAGGAGTGAA 
TATTGATCCACAAACATTTAAACTCTCAGGGGCAATGCCGGCAACTGCAATGAAAAAATTAACAGAAGCTGAAGG 
AGCTAAATTTAACACGGCAAATTTACCAGCTGCTAAGTATAAAATTTATGAAAT^ 

GGTGAAGATGGAGCAACCTTAACAGGTTCTAAAGCAGTTCCAATTGAAATTGAATTACCATTGAACGATGTTGTG 

35 GATGCGCATGTGTATCCAAAAAATACAGAAGCAAAGCCAAAAA.TTGATAAAGATTTCAAAGGTAAAGCAAATCCA 
GATACACCACGTGTAGATAAAGATACACCTGTGAACCACCAAGTTGGAGATGTTGTAGAGTACGAAATTGTTACA 
AAAATTCCAGCACTTGCTAATTATGCAACAGCAAACTGGAGCGATAGAATGACTGAAGGTTTGGCATTCAACAAA 
GGTACAGTGAAAGTAACTGTTGATGATGTTGCACTTGAAGCAGGTGATTATGCTCTAACAGAAGTAGCAACTGGTT 
TTGATTTGAAATTAACAGATGCTGGTTTAGCTAAAGTGAATGACCAAAACGCTGAAAAAACTGTGAAAATCACT^ 

40 ATTCGGCAACATTGAATGACAAAGCAATTGTAGAAGTACCAGAATCTAATGATGTAACATTTAACTATGGTAATA 
ATCCAGATCACGGGAATACTCCAAAGCCGAATAAGCCAAATGAAAACGGCGATTTGACATTGACCAAGACATGGG 
TTGATGCTACAGGTGCACCAATTCCGGCTGGAGCTGAAGCAACGTTCGATTTGGTTAATGCTCAGACTGGTAAAGT 
TGTACAAACTGTAACTTTGACAACAGACAAAAATACAGTTACTGTTAACGGATTGGATAAAAATACAGAATA 
ATTCGTTGAACGTAGTATAAAAGGGTATTCAGCAGATTATCAAGAAATCACTACAGCTGGAGAAATTGCTGTCAA 

45 GAACTGGAAAGACGAAAATCCAAAACCACTTGATCCAACAGAGCCAAAAGTTGTTACATATGGTAAAAAGT^ 
CAAAGTTAATGATAAAGATAATCGTTTAGCTGGGGCAGAATTTGTAATTGCAAATGCTGATAATGCT 
TTAGCACGTAAAGCAGATAAAGTGAGTCAAGAAGAGAAGCAGTTGGTTGTTACAACAAAGGATGCTTTAGATAGA 
GCAGTTGCTGCTTATAACGCTCTTACTGCACAACAACAAACTCAGCAAGAAAAAGAGAAAGTTGACA^ 
GCTGCTTATAATGCTGCTGTGATTGCTGCCAACAATGCATTTGAATGGGTGGCAGATAAGGACAATGAAAATGTTG 

50 TGAAATTAGTTTCTGATGCACAAGGTCGCTTTGAAATTACAGGCCTTCTTGCAGGTACATATTACTTAGAAGAAAC 
AAAACAGCCTGCTGGTTATGCATTACTAACTAGCCGTCAGAAATTTGAAGTCACTGCAACTTCnTATTCAGCGACT 
GGACAAGGCATTGAGTATACTGCTGGTTCAGGTAAAGATGACGCTACAAAAGTAGTCAACAAAAAAATCACTATC 
CCACAAACGGGTGGTATTGGTACAATTATCTTTGCTGTAGCGGGGGCTGCGATTATGGGTATTGCAGTGTACGCAT 
ATGTTAAAAACAACAAAGATGAGGATCAACTTGCTTAA 

55 

4191.4 

ATGACAATGCAGAAAATGCAGAAAATGATTAGTCGTATCTTCTTTGTTATGGCTCTGTGTTTTTCTCTTGTATGGGG 

TGCACATGCAGTCCAAGCGCAAGAAGATCACACGTTGGTCTTGCAATTGGAGAACTATCAGGAGGTGGTTAGTCA 

ATTGCCATCTCGTGATGGTCATCGGTTGCAAGTATGGAAGTTGGATGATTCGTATTCCTATGATGATCGGGTGCAA 

60 ATTGTAAGAGACTTGCATTCGTGGGATGAGAATAAACTTTCTTCTTTCAAAAAGACTTCGTTTGAGATGACCTTCC 
TTGAGAATCAGATTGAAGTATCTCATATTCCAAATGGTCTTTACTATGTTCGCTCTATTATCCAGACGGATGCGGT 
TTCTTATCCAGCTGAATTTCTTTTTGAAATGACAGATCAAACGGTAGAGCCTTTGGTCATTGTAGCGAAAAAA 
GATACAATGACAACAAAGGTGAAGCTGATAAAGGTGGATCAAGACCACAATCGCTTGGAGGGTGTCGGCTTTAAA 
TTGGTATCAGTAGCAAGAGATGTTTCTGAAAAAGAGGTTCCCTTGATTGGAGAATACCGTTACAGTTCTTCTGGTC 

65 AAGTAGGGAGAACTCTCTATACTGATAAAAATGGAGAGATTTTTGTGACAAATCTTCCTCTTGGGAACTATCGTTT 
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CAAGGAGGTGGAGCCACTGGCAGGCTATGCTGTTACGACGCTGGATACGGATGTCCAGCTGGTAGATCATCAGCT 

GGTGACGATTACGGTTGTCAATCAGAAATTACCACGTGGCAATGTTGACTTTATGAAGGTGGATGGTCGGACCAA 

TACCTCTCTTCAAGGGGCAATGTTCAAAGTCATGAAAGAAGAAAGCGGACACTATACTCCTGTTCTTCAAAATGGT 

AAGGAAGTAGTTGTAACATCAGGGAAAGATGGTCGTTTCCGAGTGGAAGGTCTAGAGTATGGGACATACTATTTA 

TGGGAGCTCCAAGCTCCAACTGGTTATGTTCAATTAACATCGCCTGTTTCCTTTACAATCGGGAAAGATACTCGTA 

AGGAACTGGTAACAGTGGTTAAAAATAACAAGCGACCACGGATTGATGTGCCAGATACAGGGGAAGAAACCCTTG 

TATATCTTGATGCrrGTTGCCATTTTGTTGTTTGGTAG 

4191,5 

ATGAGCCACATATACTTATCTATTTTCACAAGTCTCTTGCTGATGCTAGGACTTGTCAATGTTGCTCAAGCCGATG 

AATATTTACGCATCGGTATGGAAGCAGCATATGCTCCCTTTAACTGGACCCAGGATGATGATAGCAACGGAGCTG 

TCAAAATCGATGGGACCAATCAGTATGCCAACGGATACGATGTTCAAATCGCCAAGAAAATCGCTAAGGACTTAG 

GTAAAGAACCTTTGGTTGTTAAAACCAAGTGGGAAGGTCTAGTCCCTGCCCTTACTTCTGGT^ 

TATCGCAGGTATGAGTCCAACTGCAGAACGCAAACAAGAAATTGCCTTTTCGAGCAGTTACTATACTAGCGAACC 

AGTTTTGCTTGTCAAAAAAGATTCTGCCTACGCAAGTGCTAAATCTTTGGATGACT^ 

TCTCAACAAGGGGTCTACCTTTATAACTTGATTGCACAAATCCCAGGTGCTAAAAAAGAAACAGCCATGGGAGAC 

TTCGCTCAAATGCGACAAGCTCTTGAGGCTGGTGTCATTGATGCTTATGTTTCTGAACGTCCAGAAGCACTGACTG 

CTGAAGCrGCGAACTCTAAGTTCAAGATGATTCAAGTAGAACCTGGTTTCAAAACTGGGGAA 

TCGCTATCGGGCTTCGTAAAAATGACAATCGTATTAGCCAAATCAATGCCAGCATTGAAACCATTTCAAAAGATG 

ACCAAGTTGCCTTGATGGATCGTATGATCAAGGAACAACCTGCCGAAGCTACAACAACTGAAGAGACTA GCAGTA 

GTTTCITTAGCCAAGTTGCTAAAATTCnTTCTGAAAACTGGCAA^ 

ATCTCTATCGTCGGAACCATCATAGGTCTCATTATTGGACITGCCATTGGTGTCnTCCGTACTGCTCCT 

AAACAAAGTCATTTACGGCCTACAAAAACTAGTCGGCTGGGTTCTCAATGTCTACATTGAAAT^ 

CCAATGATTGTTCAATCGATGGTTATCTACTATGGAACTGCCCAAGCTTTCGGGATCAACC 

TTGACCGTACACTGGCTGCTATCTTCATCGTTTCAATCAATACCGGTGCCTACATGACTGAAATCGTCCGTGGTGG 
TATCCTAGCAGTTGACAAGGGACAATTTGAAGCTGCGACTGCTCrTGGTATGACCCATAACCAGACCATGCGTAA 
GATTGTCCTACCrCAGGTAGTCCGTAACATCCTACCTGCAACTGGTAATGAATTTGTCATCAATATCAi^ 
TCTGTATTGAACGTTATCTCTGTTGTCGAACrTTATTTCTCAGGAAATACCGTG^ 

CCAGACATTTACAATCATCGCCGTGATTTACTTTGTCCTCACCTTCACCGTAACACGTATCCTACGCTTTATCG^^ 
GCAGAATGGACATGGATACCTACACTACAGGTGCTAACCAAATGCAAACGGAGGATTTGAAATAA 

4191.6 

ATGACACAAGCAATCCTTGAAATTAAACACCTCAAAAAATCCTATGGACAAAACGAAGTGCTAAAAGACAT^ 

CTCACTGTCCACAAGGGAGAGGTCATCTCTATCATCGGAAGCTCTGGAAGCGGAAAATCGACCTTCCTACGCTCC 

ATTAACCTACITGAAACACCAACTGATGGACAAATCCTTTATCATGGACAAAACGTCCTCGAAAAAGGCTATGAC 

CTCACGCAATACCGTGAAAAGTTGGGGATGGTTTTCCAATCCTTTAACCTCTTTGAAAATCTCAATGTTC^ 

ACACAATCGTCGCTCAGACAACTGTCCTAAAACGCGAACGCACAGAAGCTGAAAAGATTGCCAAAGAAAACCTG 

GAAAAGGTCGGCATGGGAGAACGCTACTGGCAAGCCAAACCAAAACAACTCTCAGGTGGTCAAAAACAACGTGT 

GGCCATCGCTCGTGCCCTCTCCATGAATCCGGACGCTATTCTCTTTGATGAACCAACATCAGCTCTCGATCCAGAA 

ATGGTTGGAGAAGTCCTCAAAATCATGCAGGACCTGGCTCAGGAAGGCTTGACCATGATTGTCGTAACCCATGAA 

ATGGAATTTGCCCGTGATGTCTCTCACCGTGTTATCTTTATGGATAAGGGCGTGATCGCTGAAGAAGGTAAACCAG 

AAGACCTCrTCACCAATCCTAAAGAAGACCGAACAAAAGAGTTCCTTCAACGCTATCTCAAATAA 

4192.3 

ATGAAAAAGTATCAACnTCTATTCAAAATAAGTGCAGTCTTCTCTTACTTATTT^ 

GACGCTTATCGTCCAAAACTATTGGCAATTTTCTTCTCAGATAGGCAATTTATTCT 

TACTTTTTATTGGAGTCATGATTGTGGTTCTTGTTAAGACAGGCCATGGTTATCTCn^ 

ATGGCnTTGGTATTCGATTTTGACAGTATTAGTGCTAGTGTTCCAGATCTCTTTTAAC^ 

TTCAGTCAACTGCGGAAGGTTGGGCTGTATTGATTGGTTATAGTGGGACTAAtnTTGCAGAGCTAGGTAT^ 

AGCCCTGTTCTTTCTGGTTCCACTGATGGAAGAATTGATTTATAGAGGATTACTGCAACATGO^ 

CGCGATTTGGTCTTGATTTGCTTCTTCCnTCTATTTrATTTGCT 

ATATCITCGTCTTTGCAACAGTTGGAATCATCITTGCrGGTTTGACCCGCT 

GCGGTGCATGTGATCAATAATATTGTAGCGACCTTCCCGTTTTTGCTCACTTTTCTACATAGGGTC^ 
4193.1 

ATGAACAAGAAACAATGGCTAGGTCTTGGCCTAGTTGCAGTGGCAGCAGTTGGACTTGCTGCATGTGGTAACCGC 

TCITCTCGTAACGCAGCTTCATCTTCTGATGTGAAGACAAAAGCAGCAATCGTCACTGATACTGGTGGTGTTGATG 

ACAAATCATTCAACCAATCAGCTTGGGAAGGTTTGCAGGCTTGGGGTAAAGAACACAATCrTTCAAAAGATAACG 

GTTTCACTTACTTCCAATCAACAAGTGAAGCTGACTACGCTAACAACTTGCAACAAGCGGCTGGAAGTTACAACCT 

AATCTTCGGTGTTGGTTTTGCCCTTAATAATGCAGTTAAAGATGCAGCAAAAGAACACACTGACTTGAACTATGTC 

TTGATTGATGATGTGATTAAAGACCAAAAGAATGTTGCGAGCGTAACTTTCGCTGATAATGAGTCAGGTTACcTTG 

CAGGTGTGGCTGCAGCAAAAACAACTAAGACAAAACAAGTTGGTTTTGTAGGTGGTATCGAATCTGAAGTTATCT 

CTCGTTTTGAAGCAGGATTCAAGGCTGGTGTTGCGTCAGTAGACCCATCTATCAAAGTCCAAGTTGACTACGCTGG 

TTCATTTGGTGATGCGGCTAAAGGTAAAACAATTGCAGCCGCACAATACGCAGCCGGTGCAGATATTGTTTACCA 

AGTAGCTGGTGGTACAGGTGCAGGTGTCTTTGCAGAGGCAAAATCTCTCAACGAAAGCCGTCCTGAAAATGAAAA 
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AGTTTGGGTTATCGGTGTTGATCGTGACCAAGAAGCAGAAGGTAAATACACTTCTAAAGATGGCAAAGAATCAAA 
CTTTGTTCTTGTATCTACTTTGAAACAAGTTGGTACAACTGTAAAAGATATTTCTAACAAGGCAGAAAGAGGAGAA 
TTCCCTGGCGGTCAAGTGATCGTTTACTCATTGAAGGATAAAGGGGTTGACTTGGCAGTAACAAACCTTTCAGAAG 
AAGGTAAAAAAGCTGTCGAAGATGCAAAAGCTAAAATCCTTGATGGAAGCGTAAAAGT 
5 TCCTGAAAAATAA 

4193.3 

ATGTCTAAAAAATTACAACAAATTTCGGTTCCCTTGATTTCTGTATTCCTAGGAATTTTACTCGGAGC 
GTGGATCTTCGGTTATGATGCTATTTGGGGCTACGAAGAATTGTTCTATACAGCCTTTGGCAGTCTGCGTGGGAT^ 

10 GGAGAAATCTTCCGTGCTATGGGTCCTCTGGTCTTGATTGGTCTTGGTTTTGCCGTTGCCAGTCGAGCTGGT^ 

TAACGTCGGACTTCCTGGTCAGGCTTTGGCAGGTTGGATTCTCAGTGGTTGGTTTGCCCTGTCGCATCCAGATATG 
CCCCGTCCCTTGATGATTCTAGCAACCATCGTGATTGCCTTGATTGCTGGTGGGATTGTCGGAGCGATTCCAGGTA 
TTCTTAGGGCCTATCTAGGGACGTCAGAGGTTATTGTAACCATCATGATGAACTACATTGTCTTGTATGTAGGGAA 
TGCCTTTATCCATGCTTTCCCTAAAGACTTCATGCAAAGTACAGATTCGACCATTCGTGTTGGGGCT 

15 TATCAGACACCTTGGTTGGCTGAGTTGACTGGTAACrCACGGATGAATATTGGTATTTTCTTTGCCATCATTGC^ 
TGCAGTTATTTGGTTCATGCTCAAGAAAACAACTCTTGGTTTTGAAATCCGTGCAGTTGGT^^ 
CAGAATATGCTGGTATTTCTGCCAAGCGGACTATTATCCTATCTATGATTATTTCAGGTGCCITGGCAGGT(^ 
GGAGCTGTTGAAGGTTTGGGAACCTTCCAGAACGTCTATGTTCAAGGTTCGTCATTAGCTATCGGATTTAACGG/^ 
TGGCGGTTAGTTTGCTTGCGGCCAACrCACCAATTGGTATACTCTTTGCAGCCTTCCTATTTGGC 

20 GGGGCTCCTGGTATGAATGCGGCGCAGGTACCATCTGAGCTTGTCAGCATTGTAACAGCGTCTATTATcriClllG 
TCAGTGTTCATTACCTTATCGAACGCTTTGTCAAACCGAAAAAACAAGTTAAAGGAGGTAAGTAA 

4194. 1 

ATGGGAGTGAAAAAGAAACTAAAGTTGACTAGTTTGCTAGGACTGTCTCTGTTAATCATGACAGCCTGTGCGACT 
25 AATGGGGTAACTAGCGATATTACAGCCGAATCGGCTGATTTTTGGAGTAAATTGGTTTACT^ 
TTCGCTTTTTATCGTTTGATATTAGTATCGGAGTGGGGATTATTCTCTTTACGGTCTTG 

CCAGTCTTTCAGGTGCAAATGGTGGCTTCTAGGAAAATGCAGGAAGCTCAGCCACGCATTAAGGCGCTTCGAGAA 
CAATATCCAGGTCGAGATATGGAAAGCAGAACCAAACTAGAGCAGGAAATGCGTAAAGTATTTAAAGAAATGGG 
TGTCAGACAGTCAGACTCTCTTTGGCCGATTTTGATTCAGATGCCGGTTATTTTGGCCCTGTTCCAAG^ 
30 GAGTTGACTTTTTAAAGACAGGTCATTTCTTATGGATTAACCTTGGTAGTGTGGATACAACCCTTG 
TTAGCAGCAGTATTCACCTTTTTAAGTA(nTGGTTGTCCAACAAAGCTTTGTCTGAGCGAAA 
CGATGATGTATGGGATTCCAGTCTTGATTTTTATCTTTGCAGTTTATGCGCCAGGTGGAGTCGCCCTATACTGGAC 
AGTGTCTAATGCTTATCAAGTCTTGCAAACCTATTTCTTGAATAATCCATTCAAGATTATCGCAGAGCGCGAGGCC 
GTAGTACAGGCACAAAAAGATTTGGAAAATAGAAAAAGAAAAGCCAAGAAAAAGGCTCAGAAAACGAAATAA 

35 

4194.4 

ATGGTTATCGATCCATTTGCTATCAACGAACTAGACTATTACTTAGTTTCACACrTCCACAGT 

CATACACAGCTGCAGCAATTCTCAATAATCCTAAGTTAGAGCATGTTAAGTTTATCGGTCCTTACCACTGTGGACG 
AATCTGGGAAGGATGGGGTGTTCCAAAAGAACGTATCATCGTTGTTAAACCAGGTGACACTATCGAATTAAAAGA 

40 TATGAAGATTCATGCAGTAGAATCATTTGACCGTACTTGCTTGGTAACTCTCCCAGTGAACGGTGCTGATGAGACA 
GGCGGTGAACTTGCTGGCTTGGCTGTTACAGATGAAGAAATGGCTCAAAAGGCTGTTAACTATATCTTTGAAACAC 
CAGGTGGAACCATCTATCATGGTGCAGATTCTCACTTCTCAAACTATTTTGCAAAACATGGTAAAGACrr^ 
TGATGTTGCTTTGAATAACTATGGTGAAAATCCGGTAGGTATCCAAGACAAAATGACATCTATCGACCTTCTTCGT 
ATGGCAGAAAATCTGCGTACCAAAGTCATTATCCCAGTTCACTATGATATCTGGTCTAACirCATGGCTTCTACT 

45 ATGAGATTCTAGAACTTTGGAAAATGCGAAAAGATCGCTTGCAATACGATTTCCATCCATTTATCTGGGAA 

CGGTAAGTACACnTATCCTCAAGATCAACACTTAGTAGAATACCATCATCCACGTGGTTTTGATGATTGTT^ 

CAAGACTCTAACATTCAATTTAAAGCTTTGCTATAA 



50 

4196.2 

ATGTTCCTTTCAGGCTGGTTGTCTAGTTTTGCTAATACTTATATCCATGATTTACTGGG 
TCCATTTTTAAATGCCTTTGAAAGTGCTATTGCGGCrCCTTTGGTAGAAGAACCCTTGAAATTA 

55 TTGTTTTGGCTTTGATTCCTGTGCGAAAATTAAAATCTTTGTTTTTACTTGG^ 

ATGATTAAGGATATTGGTTATATTCGTACGGATTTGCCAGAGGGCTTTGACTTTACTATTTCGCGAAT^^ 
GTATCATCTCAGGAATTGCCTCTCACTGGACTTTTTCAGGTCTAGCTGTAGTAGGTGTTTACTTGCTTTACAGAGC 
TATAAAGGACAGAAGGTTGGCAAGAAACAGGGCCTTATTTTTCTAGGTTTAGCCTTGGGAACTCACTTC^ 
ACTCTCCTTTTGTGGAGTTGGAAACAGAGTTGCCTTTAGCGATTCCAGTGGTTACGGCTATTGCTCTCTATGGT^^ 

60 TATCATGCTTATTGCTTTGTTGAGAAACACAATGAGTTGATGACCTAG 

4197.1 

ATGAAGGTGGAACCACGTTGCGACGTCCTTTCGAGGATGTCGCATTTTTTTATTAGGATACTAATTA 
AAGAATTAGTGGAGCGCAGTTGGGCAATCCGACAAGCTTATCACGAACTGGAAGTTAAGCATCATGATTCCAAGT 
65 GGACGGTAGAAGAAGACCTCTTGGCTTTATCTAATGATATTGGAAATTTCCAACGACTGGTGATGACAAAGCAAG 
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GACGCTACTATGATGAAACACCCTACACACTGGAACAAAAACTTTCAGAAAATATCTGGTGGCTATTAGAACTTT 
CTCAACGTTTGGATATAGACATTCTGACGGAAATGGAAAACTTCCTCTCTGATAAAGAAAAGCAATTGAACGTTA 

GGACTTGGAAGTAG 
5 4197.4 

ATGCTTGATTGGAAACAATTTTTTCTAGCCTATCTGCGCTCCCGTAGTCGTCTTTTTATCTATCT 
ATTTCTTGTCTTACTCTTTCAGTTTTTATTTGCCAGTCTAGGAATTTACT^ 

GTAACCATATTATTTTTCACTTGGGACATATTGGTGGAAACGCAGGTCTATCGCCAGGAACTTCTCTATGGAGAGA 
GGGAAGCCAAGTCrCCTTTGGAAATAGCTTTAGCAGAAAAATTAGAAGCGCGTGAGATGGAACTCTATCAGCAGA 
10 GGTCAAAAGCAGAAAGAAAACTGACGGATTTGCTGGATTACTATACCTTGTGGGTCCATCAGATAAAGACCCCCA 
TTGCAGCCAGTCAACTCTTAGTTGCAGAAGTGGTCGACCGCCAACTGAAGCAGCAGCTAGAACAGGAAATTTTCA 
AAATCGACTCCTATACCAACCTAGTTTTACAGTACCTGCGTTTAGAAAGTTTCCATGATGATTTGGTCr^ 
GGTTCAAATTGAGGACTTGGTCAAGGAAATAATTCGTAAATATGCTCTTTTCITTA^ 

AATCTACATGACCTTGATAAAGAAATCGTGACGGATAAAAAGTGGCTGCTAGTGGTTATTGAGCAAATCATCTCA 
15 AACAGTCTCAAGTACACCAAGGAAGGTGGTCTGGAGATTTATATGGATGACCAAGAGCTTTGTATCAAAGATACG 
GGAATCGGGATAAAAAACAGTGATGTCCTCCGAGTATTTGAACGTGGCTTTTCAGGATACAATGGCCGm 
AGCAGTCCTCTGGACTTGGCCTTTATCTATCTAAGAAAATTTCTGAAGAACTGGGGCACCAGA^ 
TGAGGTCGGAAAAGGAACGACAGTGCGGATTCAGTTTGCTCAAGTGAACTTAGTCCTTGAGTAA 

20 4211.2 

ATGGAACITAATACACACAATGCTGAAATCTTGCTCAGTGCAGCTAATAAGTCCCACTATCCGCA^ 
CCAGAGATTGCCCTAGCAGGGCGTTCAAATGTTGGTAAATCCAG<mTATCAACACTATGTTGAACCGTAAGAATC 
TCGCCCGTACATCAGGAAAACCTGGTAAAACCCAGCTCCTGAACTTTTTTAACATTGATGACAAGATGCGCTT^ 
GGATGTGCCTGGTTATGGCTATGCTCGTGTTTCTAAAAAGGAACGTGAAAAGTGGGGGTGCATGATTGAGGAGTA 
25 CTTAACGACTCGGGAAAATCTCCGTGCGGTTGTCAGTCTAGTTGACCTTCGTCATGACCCGTCAGCAGATGATGTG 
CAGATGTACGAATTTCTCAAGTATTATGAGATTCCAGTCATCATTGTGGCGACCAAGGCGGACAAGATTCCTCGTG 
GTAAATGGAACAAGCATGAATCAGCAATCAAAAAGAAATTAAACTTTGACCCGAGTGACGATTTCATCCTC^^ 
ATCTGTCAGTAAGGCAGGGATGGATGAGGCTTGGGATGCAATCTTAGAAAAATTGTGA 

30 4211.3 

ATGACAAAGAAACAACTTCACTTGGTGATTGTGACAGGGATGAGTGGCGCAGGGAAAACTGTAGCCATTCAGTCC 
TTCGAGGATCTAGGTTATTTCACCATTGATAATATGCCGCCAGCTCrCTTGCCTAAGTTTTT 
TAAGGAAGACAATCCTAAGTTGGCCTTGGTAGTGGATATGCGTAGCCGTTCTTTCTTTTCAGAGATTC^ 
TTGGATGAGTTGGAAAATCAAGATGGTTTGGATTTCAAAATCCrCTTTTTGGATGCGGCT 

35 CTCGTTACAAGGAAACCAGACGGAGTCACCCACTAGCAGCAGACGGTCGTATTTTAGATGGAATCAAGTTGGAAC 
GTGAACTCTTGGCACCTTTGAAAAATATGAGCCAAAATGTGGTGGATACGACTGAACTCACTCCACGTGAGcTG 
CAAAACCCITGCAGAGCAGTTTTCAGACCAAGAACAAGCCCAGTCITTCCGTATCGAAGTCATGTCTTTCGGAT^ 
AAGTATGGAATCCCGATTGATGCGGACTTGGTCTTTGATGTCCGTTTCTTGCCAAATCCCTATTATT ^ 
GAGAAACCAAACGGGTGTGGATGAACCTGTTTATGATTATGTCATGAACCATCCTGAGTCAGAAGACi i ri ATCAA 

40 CATTTATTGGCCTTGATTGAGCCGATTCTGCCAAGTTACCAAAAGGAAGGTAAGTCCGTTTTGACCAT^ 

GATGTACGGGTGGACAACACCGTAGTGTGGCATTTGCTAAACGCnTGGCGCAGGACTTATCCAAGAATTGGTCTGT 

TAATGAAGGGCATCGCGACAAAGACCGCAGAAAGGAAACGGTAAACCGTTCATGA 
4211.4 

45 ATGAGAAAACCAAAGATAACGGTGATTGGTGGAGGGACTGGAAGTCCCGTCATTCTAAAAAGTCTGCGGGAAAAA 
GATGTGGAAATCGCAGCTATCGTGACGGTGGCAGATGATGGTGGTTCTTCAGGTGAACTCCGAAAAAATATGCAA 
CAGTTGACACCGCCAGGTGATCTTCGTAATGTCCTTGTGGCCATGTCGGATATGCCTAAGTTTTATGAGAAGGTCT 
TTCAGTATCGGTTCTCTGAGGATGCCGGAGCCTTTGCTGGCCATCCATTGGGAAATCTCATCATTGCT 
AGAAATGCAGGGTTCAACCTATAATGCCATGCAGTTATTGAGCAAATTTTTCCATACAACAGGGAAAATT^ 

50 TCCAGTGACCATCCTTTGACCCTTCATGCAGTCTTTCAGGATGGGACAGAAGTGGCTGGAGAGAGTCATATTGTAG 
ACCATCGAGGCATAATTGACAATGTCTATGTGACCAATGCCCTAAACGATGATACGCCTCTGGCCAGCCGTCGAG 
TAGTGCAGACCATCCTTGAAAGTGACATGATTGTCCTAGGGCCAGGTTCCCTCTTTACCTCTATTTTGCCCAATAT 
CGTGATTAAGGAAATTGGGCGGGCTCTTTTGGAAACCAAGGCAGAAATTGCCTATGTCTGCAATATCATGACCCA 
ACGTGGGGAGACGGAACACTTTACAGATAGCGACCACGTGGAAGT(nTGCATCGTCACCTTGGTCGCCCTTTTATC 

55 GACACTGTCnTGGTGAATATTGAAAAAGTGCCTCAGGAATACATGAATTCCAACCGTTTTGATGAATACTTAGTGC 
AAGTGGAACACGATTTTGTAGGTCTTTGTAAGCAAGTTTCGCGCGTGATTTCATCTAACIT 
TGGCGGTGCCTTCCACGATGGAGATTTGATTGTGGACGAGTTGATGCGCATTATACAGGTGAAAAAATGA 

4213.1 

60 ATGAAAAATTTGATAAAGTTGCTAATAATTAGATTGATTGTTAACTTAGCAGACAGTGTATTTTATATAGTAGCAT 
TGTGGCACGTTAGCAATAATTATTCTTCGAGCATGTTCTTAGGAATATTTATTGCAGTAAATTATCTACCGGATTTG 
TTACrAATCTTTTTTGGACCAGTTATTGACAGAGTAAATCCGCAAAAAATTCTTATAATATCAATT^ 
AGCAGTGGCTGTAATATTTTTATTATTATTAAACCAAATATCATTTTGGGTGATAATGAGTCTAGTGTTTAT^ 
TAATGGCTAGCTCCATAAGTTACGTGATAGAAGATGTGTTGATTCCTCAAGTGGTAGAATATGATAAGATTGTATT 

65 TGCAAATTCTCTTTTTAGTATTTCGTATAAAGTATTAGATTCTATTTTTAATTCAT^ 
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CAGTAGGATTTATTTTATTGGTTAAGATAGATATAGGCATATTTTTACTTGCTCTATTTAT^ 

TTTAGAACTAGCAATGCGAATATAGAAAACTTCrCTTTCAAATATTACAAGAGAGAAGTGTTGCAAGGTACAAAG 

TTTATTTTAAATAATAAATTATTATTTAAAACCAGTATTTCTTTAACGCTT 

GTAGTTGTACCGATTTTTTCTATTCGATATTTTGATGGTCCGATTTTTTATGGTATT^^ 

GGTGGTATATTGGGAAATATGCTAGCGCCAATCGTAATAAAATATTTAAAATCGAATCAAATTGTTGGTGTATTTC 

TTTTTTTGAACGGCrCAAGTTGGTTAGTAGCAATTGTTATAAAAGACTATACTTTATCAC^ 

GTTTTATGTCTAAAGGAGTCTTCAATATTATTTTTAATTCGTTGTACCAACAAATACCTCCACATC^^ 

AGGGTAAATACTACCATTGATTCTATTATTTCTTTTGGAATGCCAATTGGTAGTTTAGTTGCAGGAACGCT^ 

TTTGAATATTGAATTAGTGTTAATTGCTATTAGCATACCTTATTTTTTGTTTTC^ 

ATTGAAAGAATTTAGTATATATTAG 

4213.2 

ATGATGTCTAACAAAAATAAGGAAATTCTGATTTTTGCGATTCTCTATACAGTCCTCTTTATG 

ATTGCTGGCTTCTTTAATGCCATCTGCCATTGCAAATTATCTTGTTTATGTAGTTTTAGCTCT 

GTTCAAGGATAGATTGATCCAACAATGGAAGGAGATTAGAAAGACTAAAAGAAAATTCITCTTTG^ 

AGGATGGCTCTTTCTCATTCTGATGACTGTTGTCTTTGAATTTGTATCAGAGATGTTGAAGCAGTT^ 

ATGGACAAGGTCTAAATCAGTCTAATATTCAAAGTACCTTTCAAGAACAACCACTACTGATAGCTGri rl IGCTTG 

TGTCATTGGACCTCTGGTAGAAGAATTATTTTTCCGTCAGGTCTTATTGCATTACTTGCAGGAACGG 

TACTAAGCATTATTCTGGTAGGACTTGTTTTTGCTCTGACTCATATGCACAGTTTGGCTCTATCAGAGTGGATTGGT 

GCAGTTGGTTACTTAGGTGGAGGCCTTGCCTTTTCTATTATTTATGTGAAAGAAAAAGAGAATATCT 

TACTTGTTCACATGTTAAGCAACAGCCTCTCCTTAATCATTTTAGCTATCAGTATAGTAAAATGA 



4224. 1 

TTGAAAAAGCCAATTATCGAATTCAAAAACGTCTCTAAAGTTTTTGAAGACAGCA^ 

ATCAACITTGAGTTGGAAGAAGGGAAATTCTACACCCrTCTAGGTGCATCTGGTTCGGGGA^ 

ACATTATTGCAGGTTTACTGGATGCGACGACAGGAGATATCATGCTAGACGGTGTTCGTATCAATGATATTCCAAC 

CAACAAGCGCGACGTACATACCGTCTTCCAATCCTATGCCTTGTTCCCACATATGAATGTGTTTGAAAATGTTGCC 

TTTCCACTTCGCTTGCGTAAAATTGATAAGAAAGAAATCGAGCAGCGTGTAGCGGAAGTTCTCAAGATGGTTCAGT 

TGGAAGGTTATGAAAAACGTTCCATCCGCAAACTTTCTGGAGGACAACGTCAGCGTGTGGCCATCGCCCGTGCTA 

TCATCAACCAACCCCGTGTGGTCTTGTTGGACGAGCCTTTATCAGCGCTGGACTTGAAATTGAGAACAGACATGCA 

GTACGAATTGCGTGAATTACAACAACGATTGGGCATTACCTTTGTCTTTGTCACTCACGATCAGGAAGAAGCrC^ 

GCCATGAGTGACTGGATTTTCGTTATGAATGATGGCGAGATTGTCCAGTCTGGAACCCCTGTGGACATCTACGATG 

AGCCAATCAACCACTTTGTTGCCACCTTTATCGGGGAGTCAAACATCTTGCCAGGTACCATGATTGAGGACTACTT 

GGTCGAATTTAACGGCAAACGCTTTGAAGCGGTTGATGGTGGGATGAAGCCAAATGAACCTGTTGAGGTCGTTAT 

TCGTCCAGAGGACITGCGCATTACCCTTCCTGAAGAAGGCAAGCTCCAAGTTAAGGTCGATACCCAGCTTTT^ 

GGAGTTCATTATGAAATTATCGCCTATGACGAACTTGGAAATGAATGGATGATCCACTCAACCCGTAAGGCTATCG 

TGGGTGAGGAAATCGGTCTGGACTTTGAACCAGAAGACATCCACATCATGCGTCTCAATGAAACCGAAGAAGAGT 

TCGATGCTCGTATTGAGGAGTACGTAGAAATCGAAGAGCAAGAAGCAGGTTTGATCAATGCAATCGAGGAGGAAA 

GAGATGAAGAAAACAAGCTCTAA 

4252.1 

ATGAAATCAATGAGAATCTTATTTTTGTTAGCTTTAATTCAAATCAGTTTGAGTAGCTG 
GCATCITGTCCTTTAAACAAAGTACAGCTTTTTTCATCGGAAGCATGGTT^ 

AAATTATCTTTATACCCGTAAGCAAGAAGTCCATAGTGTCCTAGCCAGTAAGAAGTCGGTGAAGCriTTTTACA 

ATGTTACTCTTAATTAATTTGTTAGGAGCTGTTCTTGTTTTGTCAGATAACT^ 

AGAATTAGTTGACTTTTTATTGCCATCCTTCITTTTCCTATTTGGGC^ 

ATACGTGCGCGA'rrri'Cin''GCTATGCTGGACAGAAAAAAGACAGTGTTGGTGACTATTTTAGCAAC^ 

TTAAGAAATCCAATGACCATTGTCTCACTTCTGATTTATATTGGACTGGGCTTGT^^ 

AAATTCGGTTAAGAAGGAAGTTTCCTTTTATGGTCATATTTTCCGAGATCTTGTAT^ 

TCTTTTAG 

4252.2 

ATGGTTAAAAAAATTATTGGAATGGTGCTAGCTTTACTTTCTGTAACTGTAGTAGGAGTAGGTGTTTTTG(^ 

TATTTATCAACAAGGGACAGAAACCTTAGCTAAAACCTATAAAAAAATCGGTGAAGAAACCAAGGTTATTGAAGC 

GACTGAACCTCTAACCATTCTGTTAATGGGAGTGGACACCGGAAATGTTGAACGAACTGAAACTTGGGTCGGTAG 

AAGTGATAGCATGATCTTGATGACAGTGAATCCTAAAACGAAAAAAACAACAATGATGAGTTTAGAGCGGGATAT 

TCTGACGCGCATTGAATCAGGGAATGGTCAGGCTCATGAAGCGAAACTGAACTCAGCATATGCAGATGGTGGAGC 

AGAGCTTGCTATAGAAACCATTCAAAAAATGATGAATATCCATATTGATCGCTATGTGATGGTCAATATGAGAGG 

ATTGCAAAAACTAGTGGATGCAGTAGGAGGTATTACAGTCAATAATATCCTAGGTTTCCCAATTTCTATCAGTGAC 

CAAGAAGAATTTAATACTATTTCTATCGGTGTTGGGGAGCAACATATTGGGGGAGAAGAAGCCCTAGTCTATGCA 

CGAATGCGTTACCAAGATCCTGAGGGGGATTATGGTCGTCAAAAACGTCAACGTGAAGTTATTCAAAAAGTCATG 

GAAAAAGCTCTCAGTTTAAATAGCATTGGTCATTATCAAGAGATTCTAAAAGCTTTGAGTGACAATATGCAGACC 
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AATATrGATTTGTCTGCAAAAAGTATCCCTAACTTGCTAGGCTATAAAGATTCATTTAAAACCATTGAAACTCAGC 
AGTTGCAGGGTGAAGGAGAGATACITCAAGGTGTTTCTTACCAGATTGTTTCGAGAGCACATATGTTGGAAATGCA 
AAATCTACTCCGACGTTCTTTGGGACAAGAAGAAGTTACTCAGCTTGAAACCAATGCGGTTTTAT^ 
TTTGGCAGAGCACCTGTTGGTGATGAAGATAATTAA 

5 

4256.2 

ATGAAAAAACAAGCCTATGTCATTATTGCTCTCACCTCCTTCCTATTTGTCTTTTTTTTCT^ 

AATACTTGATTTTGACTGGTCTATCTTTTTGCACGATGTCGAAAAAACAGAAAAATTTGTCTT^^ 

TCAGCATGTCCATGACCTGTCTCTTAGCCCTGTTTTGGCGAGGGATCGAAGAGCTTTCTCTAAGAAAAATGCAGGC 

10 TAATCTCAAGCGTTTATTAGCAGGGCAAGAAGTGGTTCAGGTTGCAGATCCAGATTTGGATGCCAGTTTCAAGTCC 
TTATCAGGTAAACTTAACCTTTTGACAGAGGCTCTTCAAAAAGCTGAAAATCAGAGCCTTGCTCAGGAAGAGGAA 
ATCATCGAGAAGGAACGGAAGCGAATTGCTCGGGATTTGCACGATACAGTCAGTCAGGAGTTGTTTGCGGCCCAC 
ATGATTTTATCGGGTATCAGTCAGCAGGCTTTGAAATTGGATAGAGAAAAGATGCAGACCCAGTTGCAGAGTGTC 
ACAGCTATTTTAGAAACAGCCCAGAAGGATTTGCGGGTTTTGCTCTTGCATTTGCGACCAG 

15 AGAGCrrGATAGAAGGGATTCAAATTCTTTTAAAAGAGCTTGAGGACAAGAGTGATC^ 

AGAATATGACGAAATTGCCTAAGAAAATCGAGGAGCATATCTTCCGTATCCTGCAAGAGTTGATTAGCAATAC^^ 
TCCGCCATGCCCAGGCATCTTGCCTAGATGTCTACCTCTATCAGACAGATGTTGAATTGCAACTGAAGGTGGTGGA 
CAATGGGATTGGTTTCCAGTTAGGGAGCTTAGACGACTTGAGTTATGGACTGCGAAATATCAAGGAGCGGGT^^ 
AGATATGGCTGGAACAGTTCAACTCTTGACAGCTCCCAAGCAAGGGCTGGCGGTTGATATCCGTATTCCCCTGT^ 

20 GATAAGGAATGA 

4263.1 

ATGATTGTTTCCATTATTTCTCAAGGATTTGTCTGGGCTATTCTAGGTCTGGGAATCTTTATGACATT^ 

AAACTTTCCAGATATGACGACAGAAGGTTCCTTCCCTCTTGGGGGAGCTGTTGCTGTCACTTTGATAACCAAAGGC 
25 GTGAACCCATTTTTAGCGACACTTGTTGCTGTAGGAGCAGGTTGTTTGGCTGGAATGGCAGCAGGCCTTCT^ 
CAAAAGGGAAGATCCCAACCTTGCTCTCAGGGATTTTGGTGATGACTTCTTGTCACTCAATCATGCTC^ 
GGGACGTGCGAATTTAGGCCTGCTTGGAACCAAGCAAATTCAGGATGTTTTGCCTTTTGATTCGGAT^ 
CTCTTGACAGGTCTCATCTTTGTGAGTATTGTTATTGCTCTCATGCTCTTTTTCTTGGAC^ 

CTATATTGCTACAGGGGATAATCCTGATATGGCTAGAAGTTTCGGGATTCATACTGGACGCATGGAGCTCATGGGC 
30 TTGGTCTTATCAAATGGTGTGATTGCCCTTGCAGGTGCCCTCATTGCTCAGCAAGAAGGTTATGCCGATGTGTCTC 
GAGGGATCGGGGTTATCGTTGTGGGGCTTGCAAGTTTGATTATTGGAGAAGTTATTTTCAAGAGTTTGAGC^ 
AGAGCGTTTGGTTACTATCGTTGTAGGTTCTATCGCTTATCAATTTTTAGTGTGGGCAGTTATCGCAC^ 
ATACAAGTTACCITCGTTTATACAGTGCCTTGATTTTAGCAGTCTGCCTCATGATTCCAACATTTAAGCA^ 
CTTGAAAGGAGCCAAGTTAAGCAAATGA 

35 

4346. 1 

ATGAAAAAAATGAAAGTTTGGTCTACTGTACTTGCAACGGGAGTTGCrCTTACTACACTTGC^ 
GTTCAAATTCTACGACTGCTTCTTCATCTGAAGAAAAAGCTGATAAAAGTC AAGA ATTAGTTATCTATT 
AGTCTCAAATGGTCGTGGTGATTGGTTAACTGCTAAAGCAAAAGAAGCTGGTTTTAATATAAAAATGGTTGATATC 
40 GCTGGCGCTCAATTAGCAGACCGTGTTATTGCTGAGAAGAATAATGCAGTTGCAGATATGGTATTTGGAATTGGTG 
CTGTTGATTCAAATAAAATTAGAGATCAAAAATTACTAGTACAGTACAAGCCTAAATGGTTAGATAAAATTGATC 
AATCTTTATCAGATAAAGATAATTATTATAATCCTGTGATTGTTCAACCATTAGTTTTAAT^ 
AAAGAAATGCCTAAAGATTGGACTGAATTAGGTAGTAAGTATAAAGGTAAATATTCAATTTCTGG 
GGTACAGGACGGGCAATTCTAGCAAGTATCTTAGTTCGATACCTTGATGATAAAGGTGAATTAGGTGTTTCCGA^ 

45 AAGGTTGGGAAGTAGCAAAAGAATATTTGAAAAATGCATACACTCTTCAAAAGGGAGA^ 

TGTTAGACAAAGAAGATCCAATACAATATGGAATGATGTGGGGTTCTGGTGCATTAGTTGGACAAAAAGAACAAA 

ATGTTGTTTTCAAAGTTATGACTCCTGAGATTGGTGTACCATTTGTAACTGAACAAACTATGGT^^ 
TAAAAAACAAGCGTTAGCTAAAGAATTTATTGATTGGTTTGGTCAATCAGAAATTCAAGTAGA^^ 
CTTTGGATCTATTCCTGCAAATAAAGATGCCCTCAAAGATCTACCTGAAGATACGAAGAAATTTGTTGATCAAG 
50 AAACCACAAAATATTGACTGGGAAGCTGTTGGAAAGCATTTGGATGAATGGGTAGAAAAAGCTGAATTAGAATAC 

GTACAATAA 
4346.2 

ATGATTAAATTTGATAATATTCAAATTAAATATGGTGATTTTGTTGCAATTGATAATCTGAATTTAGATATAC 
55 AAGGGGAATTTTTTACATTTCTTGGGCCTTCAGGATGTGGTAAATCAACTACTTTGAGAGCAT^ 

AGATCCATCATCAGGAAGTATTGAAGTTAATGGAACAGATGTCACTCATTTGGAACCTGAAAAGCGTGGAATTGG 
TATTGTATTTCAATCTTATGCGCTATTTCCAACTATGACTGTTTTTGATAATATTGCATTTGG 

AGGTAGCTCCAGATGTTATTAAAGCTAAAGTATCAGCAGTGGCAGCAAAAATTAAGATCTCTGATCAACAGTTAC 
AGCGTAATGTATCAGAATTATCTGGGGGTCAACAACAAAGGGTAGCATTGGCTCGTGCTCTGGTTCTTGAACCTAA 

60 AATTCITTGTCTAGATGAACCATTGTCAAACCTTGACGCAAAATTACGTGTAGATTTGAGAAAAGAGTTGA^^ 

CTTCAAAAAGAGTTAGGTATTACTACTTTATATGTTACTCATGATCAAGAGGAAGCCTTGACTTTATCTGATAGAA 
TTGCAGTCTTTAACAATGGATACATCGAACAGGTCGGTACACCAGTAGAGATTTATCATAATTCTCAAACTGAATT 
TGTATGTGATTTTATTGGAGATATTAATGTTTTGACCGATGAAACAGTCCACGAAGTATTATTGAAAAATACAAGC 
GTTTTCTTAGAGGATAAAAAAGGATACATTCGATTAGAGAAAGTTCGATTCAATCGTGAAACTGAACAAGATTTTA 

65 TTCTAAAAGGGACAATTATTGATGTTGAGTTTTCTGGAGTTACAATTCACTATACAATAAAAGTTTCTGAAAGTCA 
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GATTCTTAATGTAACAAGTATTGATAGTCAGGCTGCTATTAGATCTGTCGGAGAAAGTGTGGAATTATTTATCACA 
CCATCAGACGTTCTGCAATTTTAA 



4346.3 

5 ATGCGTCATAAATTAAATTTAAAAGATTGGCTTATTCGTTTAGGGTTAATCTGGTTCTTAGTAA 
TCCAAACITTGATCTAGTAGTGAATGTATTTGTAAAAGGAGGAGAATTTTCCCTTGATGCrGTA 
AAATCrCAGAGGGCACTTCAGAGTATTATGAACAGTTTTAAGTTAGCATTTTCACTCATTATTACAGT^ 
TAGGTATTCITTGTGTTCTATTTACAGAGTACTTTGATATTAAAGGTGCTAAAATTTT/^^ 
TCITTAATTTATGGAGGAGTGGTTTTAGCGACTGGATATAAATTTGTCTATGGTCCTTATGGAT^ 
10 TTTACAAAATGTTATCCCTTCTTTAGACCCTAACTGGTTTATTGGGTATGGTGCAGTCTTATTCATTATGACAT^ 
CAGGAACTGCTAATCATACATTGTTTTTAACAAATACAATTCGAAGCGTTGACTATCACACTATTGAGGCTGCTCG 

AAATATGGGAGCAAAACCATTTACTGTTTTCCGAAAAGTAGTGTTACCAACCITAATTCCAACT 

ACTATTATGGTTTTTCTTAGTGGTTTATCTGCAGTAGCAGCACCCATGATTGTTGGTGGTAAAGAAT^ 

AAATCCAATGATTATTACATTTGCAGGGATGGGGAATTCTCGTGATTTAGCTGCCCTACITGCAATT^ 

15 ATTGCAACTACAATTTTGCITACTATCATGAATAAGATAGAAAAAGGTGGAAATTATAT^ 
AAGCGCCrCITAAAAAACAAAAAATTGCGTCTAAGCCnTGGAATATCATTGCTCAC^ 
CACAGTTTTCATGCTTCCACTAATTTTTATAGTATTATACTCATTTACAGATCCAGT^ 
TAACATTATCAAACITTACnTTAGAAAATTATCGCITATTCT^ 
AGOTTATTTATTCrATTATTGCrGCGACAACAGCAACAATTCTCGCAGTTGTAT^ 

20 TAAATCTCGTTTTGATTTCTTATTTGAATATGGTGCTCTACTTCCTTGG^ 
GTTTATTATTTACTTTTAATCAGCCACAATTTCTTGTCTTGAATCAGAT^^ 

ATTGCATATATAGTTGTAAAAATCCCATTTTCTTATAGAATGGTACGTGCTATTTTATTTAGTGT^ 
GGAAGATGCAGCAAGAAGTATGGGTGCTTCACCTTTTTATACTATGATGAAGGTTATCATTCCATTTATT^ 

GTTGTTCTCTCTGTTATTGCnTTAAACTTTAACTCTTTATTAACTGACTT 
25 CTAGCTCAACCATTAGGTATTACGATTCGATCTGCAGGTGATGAAACAGCAACATCTAATGCACAAGCTCTGGTAT 

TTGTTTATACAATTGTTCTGATGATTATTTCTGGAACGGTATTATACTTCACACAAAGACCGGGGCGTAAAGTAAG 
GAAATAA 
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Table 2 

MEELVTLDCLFIDRTKIEANANKYSFVWKKTTEKFSAKLQEQIQVYFQEEITPLLIKYAMFDKKQKRGYKESAKNLANW 
HYNDKEDSYTHPDGWYYRFHHTKYQKTQTDFQQEIKVYYADEPESAPQKGLYMNERYQNLKAKECQALLSPQGRQIF 
5 AQRKIDVEPVFGQIKASLGYKRCNLRGKRQVRIDMGLVLMANNLLKYSKMKZ 

MGKGHWNRKRVYSIRKFAVGACSVMIGTCAVLLGGNIAGESVVYADETLITHTAEKPKEEKMIVEEKADKALETKNIV 

ERTEQSEPSSTEAIASEKKEDEAVTPKEEKVSAKPEEKAPRIESQASNQEKPLKEDAKAVTNEEVNQMIEDRKVDFNQN 

WYFKLNANSKEAIKPDADVSTWKKLDLPYDWSIFNDFDHESPAQNEGGQLNGGEAWYRKTFKLDEKDLKKNVRLTF 

10 DGVYMDSQVYVNGQLVGHYPNGYNQFSYDITKYLQKDGRENVIAVHAVNKQPSSRWYSGSGIYRDVTLQVTDKVHV 
EKNGTTILTPKLEEQQHGKVETHVTSKIVNTDDKDHELVAEYQIVERGGHAVTGLVRTASRTLKAHESTSLDAILEVER 
PKLWTVLNDKPALYELITRVYRDGQLVDAKKDLFGYRYYHWTPNEGFSLNGERIKFHGVSLHHDHGALGAEENYBCAE 
YRRLKQMKEMGVNSIRTTHNPASEQTLQIAAELGLLVQEEAFDTWYGGKKPYDYGRFFEKDATHPEARKGEKWSDFD 
LRTMVERGKNNPAIFMWSIGNEIGEANGDAHSLATVKRLVKVIKDVDKTRYVTMGADKFRFGNGSGGHEKIADELDA 

15 VGFNYSEDNYKALRAKHPKWLIYGSETSSATRTRGSYYRPERELKHSNGPERNYEQSDYGNDRVGWGKTATASWTFD 
RDNAGYAGQFIWTGTDYIGEPTPWHNQNQTPVKSSYFGIVDTAGIPKHDFYLYQSQWVSVKKKPMVHLLPHWNWENK 
ELASKVADSEGKIPVRAYSNASSVELFLNGKSLGLKTFNKKQTSDGRTYQEGANANELYLEWKVAYQPGTLEAIARDES 
GKEIARDKITTAGKPAAVRLIKEDHAIAADGKDLTYIYYEIVDSQGNVVPTANNLVRFQLHGQGQLVGVDNGEQASRER 
YKAQADGSWIRKAFNGKGVAIVKSTEQAGKFTLTAHSDLLKSNQVTVFTGKKEGQEKTVLGTEVPKVQTIIGEAPEMPT 

20 TVPFVYSDGSRAERPVTWSSVDVSKPGIVTVKGMADGREVEARVEVLA.LKSELPVVKR1APNTDLNSVDKSVSYVLIDGS 
VEEYEVDKWEIAEEDKAKLAIPGSRIQATGYLEGQPIHATLVVEEGNPAAPAVPTVTVGGEAVTGLTSQKPMQYRTLA 
YGAKLPEVTASAKNAAVTVLQASAANGMRASIFIQPKDGGPLQTYAIQFLEEAPKIAHLSLQVEKADSLKEDQTVKLSV 
RAHYQDGTQAVLPADKVTFSTSGEGEVAIRKGMLELHKPGAVTLNAEYEGAKDQVELTIQANTEKKIAQSIRPVNVVT 
DLHQEPSLPATVTVEYDKGFPKTHKVTWQAIPKEKLDSYQTFEVLGKVEGIDLEARAKVSVEGIVSVEEVSVTTPIAEAP 

25 QLPESVRTYDSNGHVSSAKVAWDAIRPEQYAKEGVFTVNGRLEGTQLTTKLHVRVSAQTEQGANISDQWTGSELPLAF 
ASDSNPSDPVSNVNDKLISYNNQPANRWTNWNRTNPEASVGVLFGDSGILSKRSVDNLSVGFHEDHGVGVPKSYVIEY 
YVGKTVPTAPKNPSFVGNEDHVFNDSANWKPVTNLKAPAQLKAGEMNHFSFDKVETYAVRIRMVKADNKRGTSITEV 
QIFAKQVAAAKQGQTRIQVDGKDLANFNPDLTDYYLESVDGKVPAVTASVSNNGLATVVPSVREGEPVRVIAKAENGD 
ILGEYRLHFTKDKSLLSHKPVAAVKQARLLQVGQALELPTKVPVYFTGKDGYETKDLTVEWEEVPAENLTKAGQFTVR 

30 GRVLGSNLVAEITVRVTDKLGETLSDNPNYDENSNQAFASATNDIDKNSHDRVDYLNDGDHSENRRWTNWSPTPSSNP 
EVSAGVIFRENGKIVERTVTQGKVQFFADSGTDAPSKLVLERYVGPEFEVPTYYSNYQAYDADHPFNNPENWEAVPYR 
ADKDIAAGDEINVTFKAIKAKAMRWRMERKADKSGVAMIEMTFLAPSELPQESTQSKILVDGKELADFAENRQDYQIT 
YKGQRPKVSVEENNQVASTVVDSGEDSFPVLVRLVSESGKQVKEYRIHLTKEKPVSEKTVAAVQEDLPKIEFVEKDLAY 
KTVEKKDSTLYLGETRVEQEGKVGKERIFTAINPDGSKEEKLREVVEVPTDRIVLVGTKPVAQEAKKPQVSEKADTKPID 

35 SSEASQTNKAQLPSTGSAASQAAVAAGLTLLGLSAGLVVTKGKKEDZ 

MKIMKKKYWTLAILFFCLFNNSVTAQEIPKNLDGNITHTQTSESFSESDEKQVDYSNKNQEEVDQNKFRIQIDKTELFVT 
TDKHLEKNCCKLELEPQINNDIVNSESNNLLGEDNLDNKIKENVSHLDNRGGNIEHDKDNLESSIVRKYEWDIDKVTGG 
GESYKLYSKSNSKVSIAILDSGVDLQNTGLLKNLSNHSKNYVPNKGYLGKEEGEEGIISDIQDRLGHGTAVVAQIVGDDN 
40 INGVNPHVNINVYRIFGKSSASPDWIVKAIFDAVDDGNDIINLSTGQYLMIDGEYEDGTNDFETFLKYKKAIDYANQKGV 
IIVAALGNDSLNVSNQSDLLKLISSRKKVRKPGLVVDVPSYFSSTISVGGIDRLGNLSDFSNKGDSDAIYAPAGSTLSLSEL 
GLNNFINAEKYKEDWIFSATLGGYTYLYGNSFAAPKVSGAIAMIIDKYKLKDQPYNYMFVKKFWKKHYQZ 

MKKTWKVFLTLVTALVAVVLVACGQGTASKDNKEAELKKVDFILDWTPNTNHTGLYVAKEKGYFKEAGVDVDLKLP 
45 PEESSSDLVINGKAPFAVYFQDYMAKKLEKGAGITAVAArVEHNTSGIISRKSDNVSSPKDLVGKKYGTWNDPTELAML 
KTLVESQGGDFEKVEKVPNNDSNSITPIANGVFDTAWIYYGWDGILAKSQGVDANFMYLKDYVKEFDYYSPVIIANND 
YLKDNKEEARKVIQAIKKGYQYAMEHPEEAADILIKNAPELKEKRDFVIESQKYLSKEYASDKEKWGQFDAARWNAFY 

KWDKENGILKEDLTDKGFTNEFVKZ 

50 MKRTWRNSFVTNLNTPFMIGNIEIPNRTVLAPMAGVTNSAFRTIAKELGAGLVVMEMVSDKGIQYNNEKTLHMLHIDE 
GENPVSIQLFGSDEDSLARAAEFIQENTKTDIVDINMGCPVNKIVKNEAGAMWLKDPDKIYSIINKVQSVLDIPLTVKMR 
TGWADPSLAVENALAAEAAGVSALAMHGRTREQMYTGHADLETLYKVAQALTKIPFIANGDIRTVQEAKQRIEEVGA 
DAVMIGRAAMGNPYLFNQINHYFETGEILPDLTFEDKMKIAYEHLKRLINLKGENVAVREFRGLAPHYLRGTSGAAKL 

RGAISQASTLAEIETLLQLEKAZ 

MIKNPKLLTKSFLRSFAILGGVGLVIHIAIYLTFPFYYIQLEGEKFNESARVFTEYLKTKTSDEIPSLLQSYSKSLTISAHLK 
RDIVDKRLPLVHDLDIKDGKLSNYIVMLDMSVSTADGKQVTVQFVHGVDVYKEAKNILLLYLPYTFLVTIAFSFVFSYF 
YTKRLLNPLFYISEVTSKMQDLDDNIRFDESRKDEVGEVGKQINGMYEHLLKVIYELESRNEQIVKLQNQKVSFVRGAS 
HELKTPLASLRIILENMQHNIGDYKDHPKYIAKSINKIDQMSHLLEEVLESSKFQEWTECRETTVKPVLVDILSRYQELAH 
60 SIGVTIENQLTDATRVVMSLRALDKVLTNLISNAIKYSDKNGRVIISEQDGYLSIKNTCAPLSDQELEHLFDIFYHSQIVTD 
KDESSGLGLYIVNNILESYQMDYSFLPYEHGMEFKISLZ 

MYLGDLMEKAECGQFSILSFLLQESQTTVKAVMEETGFSKATLTKYVTLLNDKALDSGLELAIHSEDENLRLSIGAATK 
GRDIRSLFLESAVKYQILVYLLYHQQFLAHQLAQELVISEATLGRHLAGLNQILSEFDLSIQNGRWRGPEHQIHYFYFCL 
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FRKVWSSQEWEGHMQKPERKQEIANLEEICGASLSAGQKLDLVLWAHISQQRLRVNACQFQVIEEKMRGYFDNIFYLR 
LLRKVPSFFAGQHIPLGVEDGEMMIFFSFLLSHRILPLHTMEYILGFGGQLADLLTQLIQEMKKEELLGDYTEDHVTYEL 
SQLCAQVYLYKGYILQDRYKYQLENRHPYLLMEHDFKETAEEIFHALPAFQQGTDLDKKILWEWLQLIEYMAENGGQ 
HMRIGLDLTSGFLVFSRMAAILKRYLEYNRFITIEAYDPSRHYDLLVTNNPIHKKEQTPVYYLKNDLDMEDLVAIRQLLF 

5 TZ 

MEFSKXTRELSIKKMQERTLDLUIGGGITGAGVALQAAASGLETGLIEMQDFAEGTSSRSTKLVHGGLRYLKQFDVEV 
VSDTVSERAVVQQIAPHIPKSDPMLLPVYDEDGATFSLFRLKVAMDLYDLLAGVSNTPAANKVLSKDQVLERQPNLKK 
EGLVGGGVYLDFRNNDARLVIENIKRANQDGALIANHVKAEGFLFDESGKITGVVARDLLTDQVFEIKARLVINTTGPW 
10 SDKVRNLSNKGTQFSQMRPTKGVHLVVDSSKIKVSQPVYFDTGLGDGRMVFVLPRENKTYFGTTDTDYTGDLEHPKVT 
QEDVDYLLGIVNNRFPESNITIDDIESSWAGLRPLIAGNSASDYNGGNNGTISDESFDNLIATVESYLSKEKTREDVESAV 
SKLESSTSEKHLDPSAVSRGSSLDRDDNGLLTLAGGKITDYRKMAEGAMERVVDILKAEFDRSFKLINSKTYPVSGGELN 
PANVDSEIEAFAQLGVSRGLDSKEAHYLANLYGSNAPKVFALAHSLEQAPGLSLADTLSLHYAMRNELTLSPVDFLLRR 
TNHMLFMRDSLDSIVEPILDEMGRFYDWTEEEKATYRADVEAALANNDLAELKNZ 

15 

MMNELFGEFLGTLILILLGNGVVAGVVLPKTKSNSSGWIVITMGWGIAVAVAVFVSGKLSPAYLNPAVTIGVALKGGLP 

WASVLPYILAQFAGAMLGQILVWLQFKPHYEAEENAGNILATFSTGPAIKDTVSNLISEILGTFVLVLTIFALGLYDFQA 

GIGTFAVGTLIVGIGLSLGGTTGYALNPARDLGPRIMHSILPIPNKGDGDWSYAWIPVVGPVIGAALAVLVFSLFZ 

20 MTKKKIEmSVIHREKILWLKWYFMRDKEQPKYSVLERKMFDAAKNQDMLAYQKYATIKQITDIRVQTSEADILEAVKE 
VYVYNHMNVIGACQRILFISQSPAYDKLNKWFNIYSDLYFSVVPLPKMGVYHEMVGIZ 

MKNSNEAEMKLLYTDIRTSLTEILTREAEELVAAGKRVFYIAPNSLSFEKERAVLEYLSQQASFSITVTRFAQMARYLVL 
NDLPAKTTLDDIGLGLAFYKCLAELDPKDLRVYGAIKQDPQLIQQLIELYHEMTKSQMSFLDLENLTDEDKRADLLLIF 

25 EKVTAYLNQGQLAQESQLSHLIEAIENDKVSSDFNQIALVIDGFTRFSAEEERVVDLLHGKGVEIVIGAYASKKAYTSPFS 
EGNLYQASVKFLHHLASKYQTPAQDCSQTHEKMDSFDKASRLLESSYDFSELALDVDEKDRENLQIWSCLTQKEELEL 
VARSIRQKLHENSDLSYKHFRILLGDVASYQLSLKTIFDQYQIPFYLGRSEAMAHHPLTQFVESILALKRYRFRQEDLINL 
LRTDLYTDLSQSDIDAFEQYIRYLGINGLPAFQQTFTKSHHGKFNLERLNVLRLRILAPLETLFASRKQKAEKLLQKWSV 
FLKEGAVTKQLQDLTTTLEAVEQERQAEVWKAFCHVLEQFATVFAGSQVSLEDFLALLHSGMSLSQYRTIPATVDTVL 

30 VQSYDLIAPLTADFVYAIGLTQDNLPKISQNTSLLTDEERQNLNQATEEGVQLLIASSENLKKNRYTMLSLVNSARKQLF 
LSAPSLFNESESKESAYLQELIHFGFRRREKRMNHKGLSKEDMGSYHSLLSSLVAYHQQGEMSDTEQDLTFVKVLSRVI 
GKKLDQQGLENPAIPTSPSSKTLAKDTLQALYPAKQEFYLSTSGLTEFYRNEYSYFLRYVLGLQEELRLHPDARSHGNFL 
HRIFERALQLPNEDSFDQRLEQAIQETSQEREFEAIYQESLEAQFTKEVLLDVARTTGHILRHNPAIETIKEEANFGGKDQ 
AFIQLDNGRSVFVRGKVDRIDRLKANGAIGVVDYKSSLTQFQFPHFFNGLNSQLPTYLAALKREGEQNFFGAMYLEMA 

35 EPVQSLMAVKSLAGAVVEASKSMKYQGLFLEKESSYLGEFYNKNKANQLTDEEFQLLLDYNAYLYKKAAEKILAGRF 
AINPYTENGRSIAPYVQQHQAITGFEANYHLGQARFLEKLDLADGKRLVGEKLKQAWLEKIREELNRZ 



MKLIPFLSEEEIQKLQEAEANSSKEQKKTAEQIEAIYTSAQNILVSASAGSGKTFVMAERILDQLARGVEISQLFISTFTVK 
40 AATELKERLEKKISKKIQETDDVDLKQHLGRQLADLPNAAIGTMDSFTQKFLGKHGYLLDIAPNFRILQNQSEQLILENE 
VFHEVFEAHYQGKQKETFSHLLKNFAGRGKDERGLRQQVYKIYDFLQSTSNPQKWLSESFLKGFEKADFTSEKEKLTE 
QIKQALWDLESFFRYHLDNDAKEFAKAAYLENVQLILDEIGSLNQESDSQAYQAVLARVVAISKEKNGRALTNASRKA 
DLKPLADAYNEERKTQFAKLGQLSDQIAILDYQERYHGDTWKLAKTFQSFMSDFVEAYRQRKRQENAFEFADISHYTIE 
ILENFPQVRESYQERFHEVMVDEYQDTNHIQERMLELLSNGHNRFMVGDIKQSIYRFRQADPQIFNEKFQRYAQNPQEG 
45 RLIILKENFRSSSEVLSATNDVFERLMDQEVGEINYDNKHQLVFANTKLTPNPDNKAAFLLYDKDDTGEEEESQTETKL 
TGEMRLVIKEILKLHQEKGVAFKEIALLTSSRSRNDQILLALSEYGIPVKTDGEQNNYLQSLEVQVMLDTLRVIHNPLQD 
YALVALMKSPMFGFDEDELARLSLQKAEDKVHENLYEKLVNAQKMASSQKGLIHTALAEKLKQFMDILASWRLYAKT 
HSLYDLIWKIYNDRFYYDYVGALPNGPARQANLYALALRADQFEKSNFKGLSRFIRMIDQVLEAQHDLASVAVAPPKD 
AVELMTIHKSKGLEFPYVFILNMDQDFNKQDSMSEVILSRQNGLGVKYIAKMETGAVEDHYPKTIKLSIPSLTYRQNEEE 
50 LQLASYSEQMRLLYVAMTRAEKKLYLVGKGSREKLESKEYPAAKNGKLNSNTRLQARNFQDWLWAISKVFTKDKLNF 
SYRFIGEDQLTREAIGELETKSPLQDSSQADNRQSDTIKEALEMLFCEVEVYNTLHRAAIELPSVQTPSQIKKFYEPVMDM 
EGVEIAGQGQSVGBCKISFDLPDFSTKEKVTGAEIGSATHELMQRIDLSQQLTLASLTETLKQVQTSQAVRDKINLDKILAF 
FDTVLGQEILANTDHLYREQPFSMLKRDQKSQEDFVVRGILDGYLLYENKIVLFDYKTDRYDEPSQLVDRYRGQLALY 

EEALSRAYSIENIEKYLILLGKDEVQVVKVZ 

55 

MELARHAESLGVDAIATIPPIYFRLPEYSVAKYWNDISSAAPNTDYVIYNIPQLAGVALTPSLYTEMLKNPRVIGVKNSS 

MPVQDIQTFVSLGGEDHIVFNGPDEQFLGGRLMGARAGIGGTYGAMPELFLKLNQLIADKDLETARELQYAINAIIGKL 

TSAHGNMYGVIKEVLKINEGLNIGSVRSPLTPVTEEDRPVVEAAAALIRETKERFLZ 

60 

MYKTKCLREKLVLFLKIFFPILIYQFANYSASFVDTAMTGQYNTMDLAGVSMATSIWNPFFTFLTGIVSALVPIIGHHLG 
RGKKEEVASDFYQFIYLALGLSVVLLGMVLFLAPIILNHIGLEAAVAAVAVRYLWFLSIGIIPLLLFSVIRSLLDSLGLTKL 
SMYLMLLLLPLNSGFNYLLIYGAFGVPELGGAGAGLGTSLAYWVLLGISVLVLFKQEKLKALHLEKRIPLNMDKIKEGV 
RLGLPIGGTVFAEVAIFSVVGLIMAKFSPLIIASHQSAMNFSSLMYAFPMSISSAMAIVVSYEVGAKRFDDAKTYIGLGRW 
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TALIFAAFTLTFLYIFRGNVASLYGNDPKFIDLTVRFLTYSLFFQLADTFAAPLQGILRGYKDTVIPFYLGLLGYWGVAIP 
VYAIZ 

MSTLAKIEALLFVAGEDGIRVRQLAELLSLPPTGIQQSLGKLAQKYEKDPDSSLALIETSGAYRLVTKPQFAEILKEYSKA 
5 PINQSI^RAALETLSIIAYKQPITRIEIDAIRGVNSSGALAKLQAFDLIKEDGKKEVLGRPNLYVTTDYFLDYMGINHLEEL 

PVIDELEIQAQESQLFGERIEEDENQZ 

MDTMISRFFRHLFEALKSLKRNGWMTVAAVSSVMITLTLVAIFASVIFNTAKLATDIENNVRVVVYIRKDVEDNSQTIE 
KEGQTVTNNDYHKVYDSLKNMSTVKSVTFSSKEEQYEKLTEIMGDNWKIFEGDANPLYDAYIVEANTPNDVKTIAEDA 
10 KKIEGVSEVQDGGANTERLFKLASFIRVWGLGIAALLIFIAVFLISNTIRITIISRSREIQIMRLVGAKNSYIRGPFLLEGAFIG 
LLGAIAPSVLVFIVYQIVYQSVNKSLVGQNLSMISPDLFSPLMIALLFVIGVFIGSLGSGISMRRFLKIZ 

MKKVRFIFLALLFFLASPEGAMASDGTWQGKQYLKEDGSQAANEWVFDTHYQSWFYIKADANYAENEWLKQGDDYF 
YLKSGGYMAKSEWVEDKGAFYYLDQDGKMKRNAWVGTSYVGATGAKVIEDWVYDSQYDAWFYIKADGQHAEKEW 

15 LQIKGKDYYFKSGGYLLTSQWINQAYVNASGAKVQQGWLFDKQYQSWFYIKENGNYADKEWIFENGHYYYLKSGGY 
MAANEWIWDKESWFYLKFDGKMAEKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKIAEKEWVYD 
SHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKIAEKEWVYDSHSQAWYYFKSGGYMAKNETVDGYQLGSDGK 
WLGGKTTNENAAYYQVVPVTANVYDSDGEKLSYISQGSVVWLDKDRKSDDKRLAITISGLSGYMKTEDLQALDASKD 
FIPYYESDGHRFYHYVAQNASIPVASHLSDMEVGKKYYSADGLHFDGFKLENPFLFKDLTEATNYSAEELDKVFSLLNI 

20 NNSLLENKGATFKEAEEHYHINALYLLAHSALESNWGRSKIAKDKNNFFGITAYDTTPYLSAKTFDDVDKGILGATKWI 
KENYIDRGRTFLGNKASGMNVEYASDPYWGEKIASVMMKINEKLGGKDZ 

MKKVLQKYWAWAFVVIPLLLQAIFFYVPMFQGAFYSFTNWTGLTYNYKFVGLNNFKLLFMDPKFMNAIGFTAIIAIAM 
VVGEIALGIFIARVLNSKIKGQTFFRAWFFFPAVLSGLTVALIFKQVFNYGLPAIGNALHIEFFQTSLLGTKWGAIFAAVF 
25 VLLWQGVAMPIIIFLAGLQSIPTEITEAARIDGATSKQVFWNIELPYLLPSVSMVFILALKGGLTAFDQVFAMTGGGPNN 
ATTSLGLLVYNYAFKNNQFGYANAIAVILFFLIVVISIIQLRVSKKFEIZ 

MMKQDERKAUGKYILLILGSVLILVPLLATLFSSFKPTKDIVDNFFGFPTNFTWDNFSRLLADGIGGYYWNSVVrrVLSL 
LAVMIFIPMAAYSIARNMSKRKAFTIMYTLLILGIFVPFQVIMIPITVMMSKLGLANTFGLILLYLTYAIPQTLFLYVGYIKI 
30 SIPESLDEAAEIDGANQFTTYFRIIFPMMKPMHATTMIINALWFWNDFMLPLLVLNRDSKMWTLPLFQYNYAGQYFND 
YGPSFASYVVGIISITIVYLFFQRHIISGMSNGAVKZ 

MKSILQKMGEHPMLLLFLSYSTVISILAQNWMGLVASVGMFLFTIFFLHYQSILSHKFFRLILQFVLFGSVLSAAFASLEH 
FQIVKKFNYAFLSPNMQVWHQNRAEVTFFNPNYYGIICCFCIMIAFYLFTTTKLNWLKVFCVIAGFVNLFGLNFTQNRT 
35 AFPAIIAGAIIYLFTTIKNWKAFWLSIGVFAIGLSFLFSSDLGVRMGTLDSSMEERISIWDAGMALFKQNPFWGEGPLTYM 
NSYPRIHAPYHEHAHSLYIDTILSYGIVGTILLVLSSVAPVRLMMDMSQESGKRPIIGLYLSFLTVVAVHGIFDLALFWIQS 

GFIFLLVMCSIPLEHRMLVSDMTDZ 

MSKMDVQKIIAPMMKFVNMRGIIALKDGMLAILPLTVVGSLFLIMGQLPFEGLNKSIASVFGANWTEPFMQVYSGTFAI 
40 MGLISCFSIAYSYAKNSGVEALPAGVLSVSAFFILLRSSYIPKQGEAIGDAISKVWFGGQGIIGAIIIGLVVGSIYTFFIKRKIV 
IKMPEQVPQAIAKQFEAMIPAFVIFLSSMIVYILAKSLTNGGTFIEMIYSAIQVPLQGLTGSLYGAIGIAFFISFLWWFGVH 
GQSVVNGVVTALLLSNLDANKAMLASANLSLENGAHIVTQQFLDSFLILSGSGITFGLVVAMLFAAKSKQYQALGKVA 
AFPAIFNVNEPVVFGFPIVMNPVMFVPFILVPVLAAVIVYGAIATGFMQPFSGVTLPWSTPAILSGFLVGGWQGVITQLVI 

LAMSTLVYFPFFKVQDRLAYQNEIKQSZ 

45 

MKKKDLVDQLVSEIETGKVRTLGIYGHGASGKSTFAQELYQALDSTTVNLLETDPYITSGRHLVVPICDAPNQKVTASLP 
VAHELESLQRDILACRRVWMSZ 

MKKRYLVLTALLALSLAACSQEKTKNEDGETKTEQTAKADGTVGSKSQGAAQKKAEVVNKGDYYSIQGKYDEIIVAN 
50 KHYPLSKDYNPGENPTAKAELVKLIKAMQEAGFPISDHYSGFRSYETQTKLYQDYVNQDGKAAADRYSARPGYSEHQT 
GLAFDVIGTDGDLVTEEKAAQWLLDHAADYGFVVRYLKGKEKETGYMAEEWHLRYVGKEAKEIAASGLSLEEYYGF 

EGGDYVDZ 

MREPDFLNHFLKKGYFKKHAKAVLALSGGLDSMFLFKVLSTYQKELEIELILAHVNHKQRIESDWEEKELRKLAAEAE 
55 LPIYISNFSGEFSEARARNFRYDFFQEVMKKTGATALVTAHHADDQVETIFMRLIRGTRLRYLSGIKEKQVVGEIEIIRPFL 
HFQKKDFPSIFHFEDTSNQENHYFRNRIRNSYLPELEKENPRFRDAILGIGNEILDYDLAIAELSNNINVEDLQQLFSYSES 
TQRVLLQTYLNRFPDLNLTKAQFAEVQQILKSKSQYRHPIKNGYELIKEYQQFQICKISPQADEKEDELVLHYQNQVAY 
QGYLFSFGLPLEGELIQQIPVSRETSIHIRHRKTGDVLIKNGHRKKLRRLFIDLKIPMEKRNSALIIEQFGEIVSILGIATNNL 

SKKTKNDIMNTVLYIEKIDRZ 

60 

MRKFLIILLLPSFLTISKVVSTEKEVVYTSKEIYYLSQSDFGIYFREKLSSPMVYGEVPVYANEDLVVESGKLTPKTSFQIT 
EWRLNKQGIPVFKLSNHQFIAADICRFLYDQSEVTPTIKKVWLESDFKLYNSPYDLKEVKSSLSAYSQVSIDKTMFVEGRE 
FLHIDQAGWVAKESTSEEDNRMSKVQEMLSEKYQKDSFSIYVKQLTTGKEAGINQDEKMYAASVLKLSYLYYTQEKIN 
EGLYQLDTTVKYVSAVNDFPGSYKPEGSGSLPKKEDNKEYSLKDLITKVSKESDNVAHNLLGYYISNQSDATFKSKMSA 
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IMGDDWDPKEKLISSKMAGKFMEAIYNQNGFVLESLTKTDFDSQRIAKGVSVKVAHKIGDADEFKHDTGVVYADSPFIL 
SIFTKNSDYDTISKIAKDVYEVLKZ 

MKKQNNGLIKNPFLWLLFIFFLVTGFQYFYSGNNSGGSQQINYTELVQEITDGNVKELTYQPNGSVIEVSGVYKNPKTSK 
5 EETGIQFFTPSVTKVEKFTSTILPADTTVSELQKLATDHKAEVTVKHESSSGIWINLLVSIVPFGILFFFLFSMMGNMGGG 
NGRNPMSFGRSKAKAANKEDIKVRFSDVAGAEEEKQELVEVVEFLKDPKRFTKLGARIPAGVLLEGPPGTGKTLLAKA 
VAGEAGVPFFSISGSDFVEMFVGVGASRVRSLFEDAKKAAPAIIFIDEIDAVGRQRGVGLGGGNDEREQTLNQLLIEMDG 
FEGNEGIIVIAATNRSDVLDPALLRPGRFDRKVLVGRPDVKGREAILKVHAKNKPLAEDVDLKLVAQQTPGFVGADLEN 
VLNEAALVAARRNKSIIDASDIDEAEDRVIAGPSKKDKTVSQKERELVAYHEAGHTIVGLVLSNARVVHKVTIVPRGRA 
10 GGYMIALPKEDQMLLSKEDMKEQLAGLMGGRVAEEUFNVQTTGASNDFEQATQMARAMVTEYGMSEKLGPVQYEG 
NHAMLGAQSPQKSISEQTAYEIDEEVRSLLNEARNKAAEIIQSNRETHKLIAEALLKYETLDSTQIKALYETGKMPEAVE 

EESHALSYDEVKSKMNDEKZ 

MKRSSLLVRMVISIFLVFLILLALVGTFYYQSSSSAIEATIEGNSQTTISQTSHFIQSYIKKLETTSTGLTQQTDVLAYAENP 
15 SQDKVEGIRDLFLTILKSDKDLKTVVLVTKSGQVISTDDSVQMKTSSDMMAEDWYQKAIHQGAMPVLTPARKSDSQW 
VISVTQELVDAKGANLGVLRLDISYETLEAYLNQLQLGQQGFAFIINENHEFVYHPQHTVYSSSSKMEAMKPYIDTGQG 
YTPGHKSYVSQEKIAGTDWTVLGVSSLEKLDQVRSQLLWTLLGASVTSLLVCLCLVWFSLKRWIAPLKDLRETMLEIAS 
GAQNLRAKEVGAYELREVTRQFNAMLDQIDQLMVAIRSQEETTRQYQLQALSSQINPHFLYNTLDTIIWMAEFHDSQR 
VVQVTKSLATYFRLALNQGKDLICLSDEINHVRQYLFIQKQRYGDKLEYEINEhfVAFDNLVLPKLVLQPLVENALYHGI 
20 KEKEGQGHIKLSVQKQDSGLVIRIEDDGVGFQDAGDSSQSQLKRGGVGLQNVDQRLKLHFGANYHMKIDSRPQKGTKV 

EIYINRIETSZ 

MKRSSLLVRMVISIFLVFLILLALVGTFYYQSSSSAIEATIEGNSQTTISQTSHFIQSYIKKLETTSTGLTQQTDVLAYAENP 
SQDKVEGIRDLFLTILKSDKDLKTVVLVTKSGQVISTDDSVQMKTSSDMMAEDWYQKAIHQGAMPVLTPARKSDSQW 
25 VISVTQELVDAKGANLGVLRLDISYETLEAYLNQLQLGQQGFAFIINENHEFVYHPQHTVYSSSSKMEAMKPYIDTGQG 
YTPGHKSYVSQEKIAGTDWTVLGVSSLEKLDQVRSQLLWTLLGASVTSLLVCLCLVWFSLKRWIAPLKDLRETMLEIAS 
GAQNLRAKEVGAYELREVTRQFNAMLDQIDQLMVAIRSQEETTRQYQLQALSSQINPHFLYNTLDTIIWMAEFHDSQR 
VVQVTKSLATYFRLALNQGKDLICLSDEINHVRQYLFIQKQRYGDKLEYEINENVAFDNLVLPKLVLQPLVENALYHGI 
KEKEGQGHIKLSVQKQDSGLVIRIEDDGVGFQDAGDSSQSQLKRGGVGLQNVDQRLKLHFGANYHMKIDSRPQKGTKV 

30 EIYINRIETSZ 

MFFKLLREALKVKQVRSKILFTIFIVLVFRIGTSITVPGVNANSLNALSGLSFLNMLSLVSGNALKNFSIFALGVSPYITASI 
VVQLLQMDILPKFVEWGKQGEVGRRKLNQATRYIALVLAFVQSIGITAGFNTLAGAQLIKTALTPQVFLTIGIILTAGSMI 
VTWLGEQITDKGYGNGVSMIIFAGIVSSIPEMIQGIYVDYFVNVPSSRITSSIIFVIILIITVLLIIYFTTYVQQAEYKIPIQYTK 
35 VAQGAPSSSYLPLKVNPAGVIPVIFASSITAAPAAILQFLSATGHDWAWVRVAQEMLATTSPTGIAMYALUILFTFFYTF 
VQINPEKAAETYKRVVPISMEFVLVKVQKNICLNFFVVLQLLVPSSLVZ 

MDIRQVTETIAMIEEQNFDIRTn'MGISLLDCIDPDINRAAEKIYQKrrTKAANLVAVGDEIAAELGIPIVNKRVSVTPISLIG 
AATDATDYVVLAKALDKAAKEIGVDFIGGFSALVQKGYQKGDEILINSIPRALAETDKVCSSVNIGSTKSGINMTAVAD 
40 MGRIIKETANLSDMGVAKLVVFANAVEDNPFMAGAFHGVGEADVIINVGVSGPGVVKRALEKVRGQSFDVVAETVKK 
TAFKITRIGQLVGQMASERLGVEFGIVDLSLAPTPAVGDSVARVLEEMGLETVGTHGTTAALALLNDQVKKGGVMAC 
NQVGGLSGAFIPVSEDEGMIAAVQNGSLNLEKI-EAMTAICSVGLDMIAIPEDTPAETIAAML\DEAAIGVINMKTTAVRII 

PKGKEGDMIEFGGLLGTAPVMKVNGASSVDFISRGGQIPAPIHSFKNZ 

45 MTQIIDGKALAAKLQGQLAEKTAKLKEETGLVPGLVVILVGDNPASQVYVRNKERSALAAGFR5EVVRVPETITQEELL 
DLIAKYNQDPAWHGILVQLPLPKHIDEEAVLLAIDPEKDVDGFHPLNMGRLWSGHPVMIPSTPAGIMEMFHEYGIDLEG 
KNAVVIGRSNIVGKPMAQLLLAKNATVTLTHSRTHNLSKVAAKADILVVAIGRAKFVTADFVKPGAVVIDVGMNRDEN 
GKLCGDVDYEAVAPLASHITPVPGGVGPMTITMLMEQTYQAALRTLDRKZ 

50 MSKFNRIHLVVLDSVGIGAAPDANNFVNAGVPDGASDTLGHISKTVGLNVPNMAKIGLGNIPRETPLKTVAAESNPTGY 
ATKLEEVSLGKDTMTGHWEIMGLNITEPFDTFWNGFPEEILTKIEEFSGRKVIREANKPYSGTAVIYDFGPRQMETGELII 
YTSADPVLQIAAHEDIIPLDELYRICEYARSITLERPALLGRIIARPYVGEPGNFTRTANRRDLAVSPFFPTVLDKLNEAGI 
DTYAVGKINDIFNGAGINHDMGHNKSNSHGIDTLLKTMGLAEFEKGFSFTNLVDFDALYGHRRNAHGYRDCLHEFDE 
RLPEIIAAMRENDLLLITADHGNDPTYAGTDHTREYIPLLAYSPAFKGNGLIPVGHFADISATVADNFGVETAMIGESFL 

55 DKLVZ 

MFISISAGIVTFLLTLVEIPAFIQFYRKAQITGQQMHEDVKQHQAKAGTPTMGGLVFLITSVLVAFFFALFSSQFSNNVGM 
ILFILVLYGLVGFLDDFLKVFRKINEGLNPKQKLALQLLGGVIFYLFYERGGDILSVFGYPVHLGFFYIFFALFWLVGFSN 
AVNLTDGVDGLASISVVISLSAYGVIAYVQGQMDILLVILAMIGGLLGFFIFNHKPAKVFMGDVGSLALGGMLAAISMA 
60 LHQEWTLLIIGIVYVFETTSVMMQVSYFKLTGGKRIFRMTPVHHHFELGGLSGKGNPWSEWKVDFFFWGVGLLASLLT 

LAILYLMZ 

LFKKNKDILNIALPAMGENFLQMLMGMVDSYLVAHLGLIAISGVSVAGNirriYQAIFIALGAAISSVISKSIGQKDQSKLA 
YHVTEALKITLLLSFLLGFLSIFAGKEMIGLLGTERDVAESGGLYLSLVGGSIVLLGLMTSLGALIRATHNPRLPLYVSFL 
65 SNALNILFSSLAIFVLDMGIAGVAWGTIVSRLVGLVILWSQLKLPYGKPTFGLDKELLTLALPAAGERLMMRAGDVVIIA 
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LVVSFGTEAVAGNAIGEVLTQFNYMPAFGVATATVMLLARAVGEDDWKRVASLSKQTFWLSLFLMLPLSFSIYVLGVP 
LTHLYTTDSLAVEASVLVTLFSLLGTPMTTGTVIYTAVWQGLGNARLPFYATSIGMWCIRIGTGYLMGIVLGWGLPGIW 
AGSLLDNGFRWLFLRYRYQRYMSLKGZ 

5 MQTQEKHSQAAVLGLQHLLAMYSGSILVPIMIATALGYSAEQLTYLISTDIFMCGVATFLQLQLNKYFGIGLPVVLGVA 
FQSVAPLIMIGQSHGSGAMFGALIASGIYVVLVSGIFSKVANLFPSIVTGSVITTIGLTLIPVAIGNMGNNVPEPTGQSLLLA 
AITVLIILLINIFTKGFIKSISILIGLVVGTAIAATMGLVDFSPVAVAPLVHVPTPLYFGMPTFEISSIVMMCIIATVSMVEST 
GVYLALSDITKDPIDSTRLRNGYRAEGLAVLLGGIFNTFPYTGFSQNVGLVKLSGIKKRLPIYYAAGFLVLLGLLPKFGA 
LAQIIPSSVLGGAMLVMFGFVSIQGMQILARVDFANNEHNFLIAAVSIAAGVGLNNSNLFVSMPTAFQMFFSNGIVVASL 

10 LAIVLNAVLNHKKKZ 

MKDRIKEYLQDKGKVTVNDLAQALGKDSSKDFRELIKTLSLMERKHQIRFEEDGSLTLEIKKKHEITLKGIFHAHKNGFG 
FVSLEGEEDDLFVGKNDVNYAIDGDTVEVVIKKVADRNKGTAAEAKIIDILEHSLTTVVGQIVLDQEKPKYAGYIRSKN 

15 QKISQPIYVKKPALKLEGTEVLKVFIDKYPSKKHDFFVASVLDVVGHSTDVGIDVLEVLESMDIVSEFPEAVVKEAESVP 
DAPSQKDMEGRLDLRDEITFTIDGADAKDLDDAVHIKALKNGNLEFGVHIADVSYYVTEGSALDKEALNRATSVYVTD 
RVVPMLPERLSNGICSLNPQVDRLTQSAIMEIDKHGRVVNYTITQTVIKTSFRMTYSDVNDILAGDEEKRKEYHKIVSSIE 
LMAKLHETLENMRVKRGALNFDTNEAKILVDfCQGKPVDIVLRQRGIAERMIESFMLMANETVAEHFSKLDLPFIYRIHE 
EPKAEKVQKFIDYASSFGLRIYGTASEISQEALQDIMRAVEGEPYADVLSMMLLRSMQQARYSEHNHGHYGLAADYYT 

20 HFTSPIRRYPDLLVHRMIRDYGRSKEIAEHFEQVIPEIATQSSNRERRAIEAEREVEAMKKAEYMEEYVGEEYDAVVSSIV 
KFGLFVELPNTVEGLIHITNLPEFYHFNERDLTLRGHKSGITFRVGQQIRIRVERADKMTGEIDFSFVPSEFDVIEKGLKQS 
SRSGRGRDSNRRSDKKEDKRKSGRSNDKRKHSQKDtCKKKGKKPFYKEVAKKGAKHGKGRGKGRRTKZ 

MGTTGFTIIDLIILIVYLLAVLVAGIYFSKKEMKGKEFFKGDGSVPWYVTSVSIFATMLSPISFLGLAGSSYAGSWILWFA 
25 QLGMVVAIPLTIRFILPIFARIDIDTAYDYLDKRFNSKALRIISALLFIIYQLGRMSIIMYLPSAGLSVLTGIDINILIILMGVV 
AIVYSYTGGLKSVLWTDFIQGVIUSGVVLALFVLIANIKGGFGAVAETLANGKFLAANEKLFDPNLLSNSIFLIVMGSGF 
TILSSYASSQDLVQRFTTTQNIKKLNKMLFTNGVLSLATATVFYLIGTGLYVFYQVQNADSAASNIPQDQIFMYFIAYQL 
PVGITGULAAIYAASQSTISTGLNSVATSWTLDIQDVISKNMSDNRRTKIAQFVSLAVGLFSIGVSIVMAHSDIKSAYEWF 
NSFMGLVLGLLGGVFILGFVSKKANKQGAYAALIVSTIVMVFIKYFLPPTAVSYWAYSUSISVSVVSGYIVSVLTGNKVS 
30 APKYTTIHDITEIKADSSWEVRHZ 

MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVnCITDQGYV 
TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQK 
QEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNM 
35 QPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHF 
IPYSKLSALEEKIARMVPISGTGSTVSTNAKPNEVVSSLGSLSSNPSSLTTSKELSSASDGYIFNPKDIVEETATAYIVRHGD 
HFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEEDGYGFDANRIIAEDESGFVMSHGDHNHYFFKKDLTEEQIK 
VRKNIZ 

40 MKKRAIVAVIVLLLIGLDQLVKSYIVQQIPLGEVRSWIPNFVSLTYLQNRGAAFSILQDQQLLFAVITLVVVIGAIWYLHK 
HMEDSFWMVLGLTLIIAGGLGNFIDRVSQGFVVDMFHLDFINFAIFNVADSYLTVGVIILLIAMLKEEINGNZ 

MNTNLASFIVGLIIDENDRFYFVQKDGQTYALAKEEGQHTVGDTVKGFAYTDMKQKLRLTTLEVTATQDQFGWGRVT 
EVRKDLGVFVDTGLPDKEIVVSLDILPELKELWPKKGDQLYIRLEVDKKDRIWGLLAYQEDFQRLARPAYNNMQNQN 
45 WPAIVYRLKLSGTFVYLPENNMLGFIHPSERYAEPRLGQVLDARVIGFREVDRTLNLSLKPRSFEMLENDAQMILTYLE 

SNGGFMTLNDKSSPDDIKATFGISKGQFKKALGGLMKAGKIKQDQFGTELIZ 

MKDVSLFLLKKVFKSRLNWIVLALFVSVLGVTFYLNSQTANSHSLESRLESRIAANERAINENEEKLSQMSDTSSEEYQF 
AKNNLDVQKNLLTRKTEILTLLKEGRWKEAYYLQWQDEEKNYEFVSNDPTASPGLKMGVDRERKIYQALYPLNIKAH 
50 TLEFPTHGIDQIVWILEVIIPSLFVVAIIFMLTQLFAERYQNHLDTAHLYPVSKVTFAISSLGVGVGYVTVLFIGICGFSFLV 
GSLISGFGQLDYPYPIYSLVNQEVTIGKIQDVLFPGLLLAFLAFIVIVEVVYLIAYFFKQKMPVLFLSLIGIVGLLFGIQTIQP 
LQRIAHLIPFTYLRSVEILSGRLPKQIDNVDLNWSMGMVLLPCLIIFLLLGILFIERWGSSQKKEFFNRFZ 

MMKFILDIVSTPAILVALIAILGLVLQKKKLPDIIKGGIKTFVGFLVVSGGAGIVQNSLNPFGTMFEHAFHLSGVVPNNEAI 
55 VAVALTTYGSATAMIMFAGMVFNILIARFTRFKYIFLTGHHTLYMACMIAVILSVAGFTSLPLILLGGLALGIIMSISPAF 
VQKYMVQLTGNDKVALGHFSSLGYWLSGFTGSLIGDKSKSTEDIKFPKSLAFLRDSTVSITLSMAVIYIIVAIFAGSEYIEK 
EISSGTSGLVYALQLAGQFAAGVFVILAGVRLILGEIVPAFKGISERLVPNSKPALDCPIVYTYAPNAVLIGFISSFVGGLVS 
MVIMIASGTVVILPGVVPHFFCGATAGVIGNASGGVRGATIGAFLQGILISFLPVFLMPVLGGLGFQGSTFSDADFGLSGII 
LGMLNQFGSQAGIVIGLVLILAVMFGVSFIKKPSATEEZ 

MIKTFLSALSVILFSIPIITYSFFPSSNLNIWLSTQPILAQIYAFPLATATMAAILSFLFFFLSFYKKNKQIRFYSGILLLLSLIL 
LLFGTDKTLSSASNKTKTLKLVTWNVANQIEAQHIERIFSHFDADMAIFPELATNIRGEQENQRIKLLFHQVGLSMANYD 
IFTSPPTNSGIAPVTVIVKKSYGFYTEAKTFHTTRFGTIVLHSRKQNIPDIIALHTAPPLPGLMEIWKQDLNIIHNQLASKYP 
KAIIAGDFNATMRHGALAKISSHRDALNALPPFERGTWNSQSPKLFNATIDHILLPKNHYYVKDLDIVSFQNSDHRCIFT 
65 EITFZ 
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MNPIQRSWAYVSRKRLRSFILFLILLVLLAGISACLTLMKSNKTVESNLYKSLNTSFSIKKIENGQTFKLSDLASVSKIKGL 
ENVSPELETVAKLKDKEAVTGEQSVERDDLSAADNNLVSLTALEDSSKDVTFTSSAFNLKEGRHLQKGDSKKILIHEEL 
AKKNGLSLHDKIGLDAGQSESGKGQTVEFEIIGIFSGKKQEKFTGLSSDFSENQVFTDYESSQTLLGNSEAQVSAARFYVE 
5 NPKEMDGLMKQVENLALENQGYQVEKENKAFEQIKDSVATFQTFLTIFLYGMLIAGAGALILVLSLWLRERVYEVGIL 
LALGKGKSSIFLQFCLEVVLVSLGALLPAFVAGNAITTYLLQTLLASGDQASLQDTLAKASSLSTSILSFAESYVFLVLLS 
CLSVALCFLFLFRKSPKEILSSISZ 

MLHNAFAYVTRKFFKSIVIFLIILLMASLSLVGLSIKGATAKASQETFKNITNSFSMQINRRVNQGTPRGAGNIKGEDIKKI 
10 TENKAIESYVKRINAIGDLTGYDLIETPETKKNLTADRAKRFGSSLMITGVNDSSKEDKFVSGSYKLVEGEHLTNDDKDK 
ILLHKDLAAKHGWKVGDKVKLDSNIYDADNEKGAKETVEVTIKGLFDGHNKSAVTYSQELYENTAITDIHTAAKLYGY 
TEDTAIYGDATFFVTADKNLDDVMKELNGISGINWKSYTLVKSSSNYPALEQSISGMYKMANLLFWGSLSFSVLLLALL 
LSLWINARRKEVGILLSIGLKQASILGQFITESIUAIPALVSAYFLANYTARAIGm'VLA>rVa'SGVAKQASKAAQASN^ 
GAEVDGFSKTLSSLDISIQTSDFIIIFVLALVLVVLVMALASSNLLRKQPKELLLDGEZ 

MSQDKQMKAVSPLLQRVINISSIVGGVGSUFCIWAYQAGILQSKETLSAFIQQAGIWGPPLFIFLQILQTVVPIIPGALTSV 

AGVFIYGHIIGTIYNYIGIVIGCAIIFYLVRLYGAAFVQSVVSKRTYDKYIDWLDKGNRFDRFFIFMMIWPISPADFLCMLA 

ALTKMSFKRYMTIIILTKPFTLVVYTYGLTYIIDFFWQMLZ 

20 MRNMWVVIKETYLRHVESWSFFFMVISPFLFLGISVGIGHLQGSSMAKNNKVAVVTTVPSVAEGLK^fVNGVNFDYKDE 
ASAKEAIKEEKLKGYLTIDQEDSVLKAVYHGETSLENGIKFEVTGTLNELQNQLNRSTASLSQEQEKRLAQTIQFTEKIDE 
AKENKKFIQTIAAGALGFFLYMILITYAGVTAQEVASEKGTKIMEVVFSSIRASHYFYARMMALFLVILTHIGIYVVGGL 
AAVLLFKDLPFLAQSGILDHLGDAISLNTLLFILISLFMYVVLAAFLGSMVSRPEDSGKALSPLMILIMGGFFGVTALGAA 
GDNLLLKIGSYIPFISTFFMPFRTINDYAGGAEAWISLAITVIFAVVATGFIGRMYASLVLQTDDLGIWKTFKRALSYKZ 

25 

MTETIKLMKAHTSVRRFKEQEIPQVDLNEILTAAQMASSWKNFQSYSVIVVRSQEKKDALYELVPQEAIRQSAVFLLFV 
GDLNRAEKGARLHTDTFQPQGVEGLLISSVDAALAGQNALLAAESLGYGGVIIGLVRYKSEEVAELFNLPDYTYSVFG 
MALGVPNQHHDMKPRLPLENVVFEEEYQEQSTEAIQAYDRVQADYAGARATTSWSQRLAEQFGQAEPSSTRKNLEQK 
KLLZMLKLIAIVGTNSKRSTNRQLLQYMQKHFTDKAEIELVEIKAIPVFNKPADKQVPAEILEIAAKIEEADGVIIGTPEYD 
30 HSIPAVLMSALAWLSYGIYPLLNKPIMITGASYGTLGSSRAQLQLRQILNAPEIKANVLPDEFLLSHSLQAFNPSGDLVDL 
DVnCKLDAIFDDFRIFVKITEKLRNAQELLRKDAEDFDWENLZ 

MOTYQLNNGVEIPVLGFGTFKAKDGEEAYRAVLEALKAGYRHIDTAAIYQNEESVGQAIXDSGVPREEMFVTTKLWNS 
QQTYEQTRQALEKSIEKLGLDYLDLYLIHWPNPKPLRENDAWKTRNAEVWRAMEDLYQEGKIRAIGVSNFLPHHLDAL 
35 LETATIVPAVNQVRLAPGVYQDQVVAYCREKGILLEAWGPFGQGELFDSKQVQEIAANHGKSVAQIALAWSLAEGFLP 
LPKSVTTSRIQANLDCFGIELSHEERETLKTIAVQSGAPRVDDVDFZ 

MRCKMLDPIAIQLGPLAIRWYALCIVTGLILAVYLTMKEAPRKKIIPDDILDFILVAFPLAILGARLYYVIFRFDYYSQNLG 
EIFAIWNGGLAIYGGLITGALVLYIFADRKLINTWDFLDIAAPSVMIAQSLGRWGNFFNQEAYGATVDNLDYLPGFIRDQ 
40 MYIEGSYRQPTFLYESLWNLLGFAULIFRRKWKSLRRGHITAFYLIWYGFGRMVIEGMRTDSLMFFGFRVSQWLSVVLI 

GLGIMIVIYQNRKKAPYYITEEENZ 

MGKLSSILLGTVSGAALALFLTSDKGKQVCSQAQDFLDDLREDPEYAKEQVCEKLTEVKEQATDFVLKTKEQVESGEIT 
VDSILAQTKSYAFQATEASKNQLNNLKEQWQEKAEALDDSEEIVIDITEEZ 

45 

MKTKLIFWGSMLFLLSLSILLTIYLAWIFYPMEIQWLNLTNRVYLKPETIQYNFHILMNYLTNPFSQVLQMPDFRSSAAG 

LHHFAVVKNLFHLVQLVALVTLPSFYVFVNRIVKKDFLSLYRKSLLALVVLPVMIGLGGVLIGFDQFFTLFHQILFVGD 

DTWLFDPAKDPVIMILPETFFLHAFLLFFALYENFFGYLYLKSRRKZ 

50 MTYHFTEEYDIIVIGAGHAGVEASLAASRMGCKVLLATINIEMLAFMPCNPSIGGSAKGIVVREVDALGGEMAKTIDKT 
YIQMKMLNTGKGPAVRALRAQADKELYSKEMRKTVENQENLTLRQTMIDEILVEDGKVVGVRTATHQEYAAKAVIVT 
TGTALRGEIIIGDLKYSSGPNHSLASINLADNLKELGLEIGRFKTGTPPRVKASSINYDVTEIQPGDEVPNHFSYTSRDEDY 
VKDQVPCWLTYTNGTSHEIIQNNLHRAPMFTGVVKGVGPRYCPSIEDKIVRFADKERHQLFLEPEGRNTEEVYVQGLST 
SLPEDVQRDLVHSIKGLENAEMMRTGYAIEYDMVLPHQLRATLETKKISGLFTAGQTNGTSGYEEAAGQGIIAGINAAL 

55 KIQGKPELILKRSDGYIGVMIDDLVTKGTIEPYRLLTSRAEYRLILRHDNADMRLTEMGREIGLVDDERWARFEIKKNQF 
DNEMKRLDSIKLKPVKETNAKVEEMGFKPLTDAVTAKEFLRRPEVSYQDVVAFIGPAAEDLDDKIIELIETEIKYEGYISK 
AMDQVAKMKRMEEKRIPANIDWDDIDSIATEARQKFKLINPETIGQASRISGVNPADISILMVYLEGKNRSISKTLQKSKZ 

MTKQVLLVDDEEHILKLLDYHLSKEGFSTQLVTNGRKALALAETEPFDFILLDIMLPQLDGMEVCKRLRAKGVKTPIM 
60 MVSAKSDEFDKVLALELGADDYLTKPFSPRELLARVKAVLRRTKGEQEGDDSDNIADDSWLFGTLKVYPERHEVYKA 
NKLLSLTPKEFESDKNPFFEVFKVSKVTAQZ 

MTTFKDGFLWGGAVAAHQLEGGWQEGGKGISVADVMTAGRHGVAREITLGVLEGKYYPNHEAIDFYHRYKEDIALF 
AEMGFKCFRTSIAWTRIFPKGDELEPNEEGLQFYDNLFDECLKNGIEPVITLSHFEMPYHLVTEYGGWKNRKUDFFARF 
65 AEVVFKRYKDKVKYWMTFNEINNQANYQEDFAPFTNSGIVYEEGDNREAIMYQAAHYELVASARAVKIGHEINPDFQI 
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GCMIAMCPIYPVTCNPKDILMAMKAMQKRYYFADVHVLGKYPEHIFKYWERKGISVDFTAQDKEDLLGGTVDYIGFS 
YYMSFAIDSHRENNPYFDYLETEDLVKNNYVKASEWEWQIDPEGLRYALNWFTDHYHLPLFIVENGFGAIDQVAADG 
MVHDDYRIEYLGAHIREMKKAVVEDGVDLMGYTPWGCIDLVSAGTGEMRKRYGFIYVDKDDNGKGSYNRSPKKSFG 
WYKEVISSNGESVEZ 

5 

MDQQNGLFGFLENHVMGPMGKLAQFKVVRAITAAGMAAVPFTIVGSMFLVFSILPQAFSFWPIVADIFSASFDKFTSLY 
MVANYATMGSLSLYFVLSLAYELTKIYAEEEELNMNPLNGALLALMAFVMTVPQIIFDGGMMKTVTSLKEGAVIADG 
WAMG>rVVARFGTTGIFTAIIMAIVTVLIYRMCVKHNWVIKMPEAVPEGVSRGFTALVPGFVVAFVVIFINGLLVAMGT 
10 DIFKVIAIPFGFVSNLTNSWIGLMIIYLLTQLLWIVGIHGANIVFAFVSPIALANMAENAAGGHFAVAGEFSNMFVIAGGS 
GATLGLCLYIAFASKSEQLKAIGRASVVPALFNINEPLIFGLPIIYNPALAIPFILAPMVTATIYYVANSLNFIKPIIAQVPWP 
TPVGIGAFLGTADLRAVLVALVCAFAAFLVYLPFIRVYDQKLVKEEQGIZ 

MKKFYVSPIFPILVGLIAFGVLSTFIIFVNNNLLTVLILFLFVGGYVFLFKKLRVHYTRSDVEQIQYVNHQAEESLTALLEQ 
15 MPVGVMKLNLSSGEVEWFNPYAELILTKEDGDFDLEAVQTIIKASVGNPSTYAKLGEKRYAVHMDASSGVLYFVDVSR 
EQAITDELVTSRPVIGIVSVDNYDDLEDETSESDISQINSFVANFISEFSEKHMMFSRRVSMDRFYLFTDYTVLEGLMNDK 
FSVIDAFREESKQRQLPLTLSMGFSYGDGNHDEIGKVALLNLNLAEVRGGDQVVVKENDETKNPVYFGGGSAASKRT 
RTRTRAMMTAISDKIRSVDQVFVVGHKNLDMDALGSAVGMQLFASNVIENSYALYDEEQMSPDIERAVSFIEKEGVTK 
LLSVKDAMGMVTNRSLLILVDHSKTALTLSKEFYDLFTQTIVIDHHRRDQDFPDNAVITYIESGASSASELVTELIQFQNS 
20 KKNRLSRMQASVLMAGMMLDTKNFTSRVTSRTFDVASYLRTRGSDSIAIQEIAATDFEEYREVNELILQGRKLGSDVLI 
AEAKDMKCYDTVVISKAADAMLAMSGIEASFVLAKNTQGFISISARSRSKLNVQRIMEELGGGGHFNLAAAQIKDVTLS 
EAGEKLTEIVLNEMKEKEKEEZ 

MKEKNMWKELLNRAGWILVFLLAVLLYQVPLVVTSILTLKEVALLQSGLIVAGLSIVVLALFIMGARJKTKLASFNFSFF 
25 RAKDLARLGI^YLVIVGSNILGSILLQLSNETTTANQSQINDMVQNSSUSSFFLLALLAPICEEILCRGIVPICKIFRGKENL 
GFVVGTIVFALLHQPSNLPSLLIYGGMSTVLSWTAYKTQRLEMSILLHMIVNGIAFCLLALVVIMSRTLGISVZ 

MKEKNMWKELLNRAGWILVFLLAVLLYQVPLVVTSILTLKEVALLQSGLIVAGLSIVVLALFIMGARKTKLASFNFSFF 
RAKDLARLGLSYLVIVGSNILGSILLQLSNETTTANQSQINDMVQNSSLISSFFLLALLAPICEEILCRGIVPKKIFRGKENL 
30 GFVVGTIVFALLHQPSNLPSLLIYGGMSTVLSWTAYKTQRLEMSILLHMIVNGIAFCLLALVVIMSRTLGISVZ 

MDTQKIEAAVKMIIEAVGEDANREGLQETPARVARMYQEIFSGLGQTAEEHLSKSFEIIDDNMVVEKDIFFHTMCEHHF 
LPFYGRAHIAYIPDGRVAGLSKLARTVEVYSKKPQIQERLNIEVADALMDYLGAKGAFVVIEAEHMCMSMRGVRKPGT 
ATLTTVARGLFETDKDLRDQAYRLMGLZMKDLFLKRKQAFRKECLGYLRYVLNDHFVLFLLVLLGFLAYQYSQLLQH 
35 FPENHWPILLFVGITSVLLLLWGGTATYMEAPDKLFLLVGEEEIKLHLKRQTGISLVFWLFVQTLFLLLFAPLFLAMGY 
GLPVFLLYVLLLGVGKYFHFCQKASKFFTETGLDWDYVISQESKRKQVLLRFFALFTQVKGISNSVKRRAYLDFILKAV 
QKVPGKIWQNLYLRSYLRNGDLFALSLRLLLLSLLAQVFIEQAWIATAVVVLFNYLLLFQLLALYHAFDYQYLTQLFPL 
DKGQKEKGLQEVVRGLTSFVLLVELVVGLITFQEKLALLALLGAGLVLLVLYLPYQVKRQMQDZ 

40 MRKSIVLAADNAYLIPLETTIKSVLYHNRDVDFYILNSDIAPEWFKLLGRKMEVVNSTIRSVHIDKELFESYKTGPHINYA 
SYFRFFATEVVESDRVLYLDSDIIVTGELATLFEIDLKGYSIGAVDDVYAYEGRKSGFNTGMLLMDVAKWKEHSIVNSL 
LELAAEQNQVVHLGDQSILNIYFEDNWLALDKTYNYMVGIDIYHLAQECERLDDNPPTIVHYASHDKPWNTYSISRLRE 
LWWVYRDLDWSEIAFQRSDLNYFERSNQSKKQVMLVTWSADIKHLEYLVQRLPDWHFHLAAPCDCSEELTSLSQYTN 
VTVYQNVLHSRIDWLLDDSIVYLDINTGGEVFNVVTRAQESGKKIFAFDITRKSMDDGLYDGIFSVERPDDLVDRMKNI 

45 EIEZ 

MTKIYSSIAVKKGLFTSFLLFIYVLGSRIILPFVDLNTKDFLGGSTAYLAFSAALTGGNLRSLSIFSVGLSPWMSAMILWQ 
MFSFSKRLGLTSTSIEIQDRRKMYLTLLIAVIQSLAVSLRLPVQSSYSAILVVLMNTILLIAGTFFLVWLSDLNASMGIGGSI 
VILLSSMVLNIPQDVLETFQTVHIPTGIIVLLALLTLVFSYLLALMYRARYLVPVNKIGLHNRFKRYSYLEIMLNPAGGMP 
50 YMYVMSFLSVPAYLFILLGFIFPNHSGLAALSKEFMVGKPLWVYVYISVLFLFSIIFAFVTMNGEEIADRMKKSGEYIYOr 
YPGADTSRFINRLVLRFSVIGGLFWIMAGGPMLFVLFDEKLLRLAMIPGLFMMFGGMIFTIRDEVKALRLNETYRPLIZ 

MSSLSDQELVAKTVEFRQRLSEGESLDDILVEAFAVVREADKRILGMFPYDVQVMGAIVMHYGNVAEMNTGEGKTLT 
ATMPVYLNAFSGEGVMVVTPNEYLSKRDAEEMGQVYRFLGLTIGVPFTEDPKKEMKAEEKKLIYASDIIYTTNSNLGFD 

55 YLNDNLASNEEGKFLRPFNYVIIDEIDDILLDSAQTPLIIAGSPRVQSNYYAIIDTLVTTLVEGEDYIFKEEKEEVWLTTKG 
AKSAENFLGIDNLYKEEHASFARHLVYAIRAHKLFTKDKDYIIRGNEMVLVDKGTGRLMEMTKLQGGLHQAIEAKEHV 
KLSPETRAMASITYQSLFKMFNKISGMTGTGKVAEKEFIETYNMSVVRIPTNRPRQRIDYPDNLYITLPEKVYASLEYIKQ 
YHAKGNPLLVFVGSVEMSQLYSSLLFREGIAHNVLNANNAAREAQIISESGQMGAVTVATSMAGRGTDIKLGKGVAEL 
GGLIVIGTERMESQRIDLQIRGRSGRQGDPGMSKFFVSLEDDVIKKFGPSWVHKKYKDYQVQDMTQPEVLKGRKYRKL 

60 VEKAQHASDSAGRSARRQTLEYAESMNIQRDIVYKERNRLIDGSRDLEDVVVDIIERYTEEVAADHYASRELLFHFIVTN 
ISFHVKEVPDYIDVTDKTAVRSFMKQVIDKELSEKKELLNQHDLYEQFLRLSLLKAIDDNWVEQVDYLQQLSMAIGGQS 
ASQKNPIVEYYQEAYAGFEAMKEQIHADMVRNLLMGLVEVTPKGEIVTHFPZ 

MIGTFAAALVAVLANFIVPIEITPNSANTEIAPPDGIGQVLSNLLLKLVDNPVNALLTANYIRILSWAVIFGIAMREASKNS 
65 QELLKTIADVTSKIVEWIINLAPFGILGLVFKTISDKGVGSLANYGILLVLLVTTMLFVAPVVNPLIAFFFMRRNPYPLVW 
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NCLRVSGVTAFFTRSSATNIPVNMKLCHDLGLNPDTYSVSIPLGSTINMAGVAITINLLTLAAVNTLGIPVDFATAFVLSV 
VAAISSCDASGIAGGSLLLIPVACSLFGISNDIAIQIVGVGFVIGVIQDSCETALNSSTDVLFTAVAEYAATRKKZ 

5 MSISQRTTKLILATCLACLLAYFLNLSSAVSAGIIALI^I^DTRRSTLKLARNRLFSMLLALAIGVLAFHLSGFHIWSLGLY 
LAFYVPLAYKMGWEIGITPSTVLVSHLLVQESTSPDLLVNEFLLFAIGTGFALLVNLYMPSREEEIQHYHTLVEEKLKDI 
LQRFKYYLSRGDGRNRAQLVAELDTLLKEALRLVYLDHSDHLFHQTDYHIHYFEMRQRQSRILRNMAQQINTCHLAAS 
ESLILAQLFSKIAGQLSQTNPASDLLDEIERYLEVFRNRSLPKTREEFETRATLLQLLREAKTFIQVKVDFYQKYRQZ 

10 MEIMSLAIAVFAVIIGLVIGYVSISAKMKSSQEAAELMLLNAEQEATNLRGQAEREADLLVNEAKRESKSLKKEALLEAK 
EEARKYREEVDAEFKSERQELKQIESRLTERATSLDRKDDNLTSKEQTLEQKEQSISDRAKNLDAREEQLEEVERQKEAE 
LERIGALSQAEARDIILAQTEENLTREIASRIREAEQEVKERSDKMAKDILVQAMQRIAGEYVAESTNSTVHLPDDTMKG 
RIIGREGRNIRTFESLTGVDVIIDDTPEVVTLSGFDPIRREIARMTMEMLLKDGRIHPARIEELVEKNRQEIDNKIREYGEA 
AAYEIGAPNLHPDLMKIMGRLQFRTSYGQNVLRHSIEVAKLAGIMASELGENAALARRAGFLHDIGKAIDHEVEGSHVE 

15 IGMELARKYKEPPVVVNTIASHHGDVEAESVIAVIVAAADALSAARPGARSESLESYIKRLHDLEEIANGFEGVQTSFAL 
QAGREIRIMVNPGKDCDDKVTILAHKVRKKIENNLDYPGNIKVTVIRELRAVDYAKZ 

MMLKPSIDTLLDKVPSKYSLVILEAKRAHELEAGAPATQGFKSEKSTLRALEEIESGNVTIHPDPEGKREAVRRRIEEEKR 
RKEEEEKKDCEQIAKEKEDGEKIZ 

20 

MSAYQLPTVWQDEASNQGAFTGLNRPTAGARFEQNLPKGEQAFQLYSLGTPNGVKVTILLEELLEAGFKEAAYDLYKI 

AIMDGDQFGSDFVKLNPNSKIPALLDQSGTENVRVFESAHILLYLAEKFGAFLPSNPVEKVEVLNWLFWQAGAAPFLG 

GGFGHFFNYAPEKLEYPINRFTMEVKRQLDLLDKELAQKPYIAGNDYTIADIAIWSWYGQLVQGNLYQGSAKFLDASS 

YQNLVKWAEKIANRPAVKRGLEVTYTEIKZ 

LASLITSIIMFYVGFDVLRDTIQKILSREETVIDPLGATLGIISAAIMFVVYLYNTRLSBOfCSNSNALK^^AAKDN 
GTAIAILASSFNYPIVDKLVAIIITFFILKTAYDIFIESSFSLSDGFDDRLLEDYQKAIMEIPKISKVKSQRGRTYGSNIYLDIT 
LEMNPDLSVFESHEIADQVESMLEERFGVFDTDVHIEPAPIPEDEILDNVYKKLLMREQLIDQGNQLEELLTDDFVYIRQ 
DGEQMDKEAYKTKKELNSAIKDIQITSISQKTKLICYELDGIIHTSIWRRHETWQNIFHQETKKEZ 

30 

MTIKLVATDMDGTFLDGNGRFDMDRLKSLLVSYKEKGIYFAVASGRGFLSLEKLFAGVRDDIIFIAENGSLVEYQGQDL 
YEATMSRDFYLATFEKLKTSPYVDINKLLLTGKKGSYVLDTVDETYLKVSQHYNENIQKVASLEDITDDIFKFTTNFTEE 
TLEDGEAWVNENVPGVKAMTTGFESIDIVLDYVDKGVAIVELVKKLGITMDQVMAFGDNLNDLHMMQVVGHPVAPE 

NARPEILELAKTVIGHHKERSVIAYMEGLZ 

35 

MADIKLIALDLDGTLLTTDKRLTDRTKETLQAARDRGIKVVLTTGRPLKAMDFFLHELGTDGQEDEYTITFNGGLVQK 
NTGEILDKTVFSYDDVARLYEETEKLSLPLDAISEGTVYQIQSDQESLYAKFNPALTFVPVDFEDLSSQMTYNKCVTAFA 
QEPLDAAIQKISPELFDQYEIFKSREMLLEWSPKNVHKATGLAKLISHLGIDQSQVMACGDEANDLSMIEWAGLGVAM 
QNAVPEVKAAANVVTPMTNDEEAVAWAIEEYVLKENZ 

MESLLILLLIANLAGLFLIWQRQDRQEKHLSKSLEDQADHLSDQLDYRFDQARQASQLDQKDLEVVVSDRLQEVRIELH 
QGLTQVRQEMTDNLLQTRDKTDQRLQALQESNEQRLEQMRQTVEEKLEKTLQTRLQASFETVSKQLESVNRGLGEMQ 
TVARDVGALNKVLSGTKTRGILGELQLGQIIEDIMTPAQYEREYATVENSSERVEYAIKLPGQGDQEYVYLPIDSKFPLA 
DYYRLEEAYETGDKDEIERCRKSLLASVKRFARDIRNKYIAPPRTTNFGVLFVPTEGLYSEIVRNPVFFDDLRREEQIIVA 
45 GPSTLSALLNSLSVGFKTLNIQKSADHISKTLASVKTEFGKFGGILVBCAQKHLQHASGNIDELLNRRTIAIERTLRHIELSE 
GEPALDLLHFQENEEEYEDZ 

MKISHMKKDELFEGFYLIKSADLRQTRAGKNYLAFTFQDDSGEIDGKLWDAQPHNIEAFTAGKVVHMKGRREVYNNT 
PQVNQITLRLPQAGEPNDPADFKVKSPVDVKEIRDYMSQMIFKIENPVWQRIVRNLYTKYDKEFYSYPAAKTNHHAFET 
50 GLAYHTATMVRLADAISEVYPQLNKSLLYAGIMLHDLAKVIELTGPDQTEYTVRGNLLGHIALIDSEITKTVMELGIDDT 
K£EVVLLRHVILSHHGLLEYGSPVRPRIMEAEIIHMIDNLDASMMMMSTALALVDKGEMTNKIFAMDNRSFYKPDLDZ 

MSEKAKKGFKMPSSYTVLLIILAIMAVLTWFIPAGAFIEGIYETQPQNPQGIWDVLMAPIRAMLGTHPEEGSLIKETSAAID 
VAFFILMVGGFLGIVNKTGALDVGIASIVKKYKGREKMLILVLMPLFALGGTTYGMGEETMAFYPLLVPVMMAVGFDS 
55 LTGVAIILLGSQIGCLASTLNPFATGIASATAGVGTGDGIVLRLIFWVTLTALSTWFVYRYADKIQKDPTKSLVYSTRKED 
LKHFNVEESSSVESTLSSKQKSVLFLFVLTFILMVLSFIPWTDLGVTIFDDFNTWLTGLPVIGNIVGSSTSALGTWYFPEG 
AMLFAFMGILIGVIYGLKEDKIISSFMNGAADLLSVALIVAIARGIQVIMNDGMITDTILNWGKEGLSGLSSQVFIVVTYIF 
YLPMSFLIPSSSGLASATMGIMAPLGEFVNVRPSLIITAYQSASGVLNLIAPTSGIVMGALALGRINIGTWWKFMGKLVVA 

IIVVTIALLLLGTFLPFLZ 

60 

MSNSFVKLLVSQLFANLADIFFRVTIIANIYIISKSVIATSLVPILIGISSFVASLLVPLVTKRLALNRVLSLSQFGKTILLAIL 
VGMFTVMQSVAPLVTYLFVVAISILDGFAAPVSYAIVPRYATDLGKANSALSMTGEAVQLIGWGLGGLLFATIGLLPTT 
CINLVLYIISSFLMLFLPNAEVEVLESETNLEILLKGWKLVARNPRLRLFVSANLLEIFSNTIWVSSIILVFVTELLNKTESY 
WGYSNTAYSIGIIISGLIAFRLSEKFLAAKWEPQLFTPNLKTIQNPCLSLDPGWFLFSPNGCFLLDKKEFPLYGISVEKNTK 
65 RKETHMNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYLVTNDQRRYCRYSDSDILVQFLFQLL 
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TGYGTDYACKELSADAYFPKLLEGGQLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDSTHFTTY 
GKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEEADSFITPVLERFNQLLFRMDSGFATPKLYDLIE 
KTGQYYLIKLKKNTVLSRLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNLFYDVISLVTNMTS 
GTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTLIKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFL 
5 HVVGKCVRTGRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMHZ 

MMEFFQQLPHLEPYGNPQYFVYVIAATLPIFIGLFFKKRFAWYEVLVSLFFIVTMLVGGKTNQLAALGIYLCWEILLLLF 
YKHYRKSKDGKWVFYLVSFLSLLPIIFVKVQPAINGTQSLLGFLGISYLTFRSVGIVIELRDGVIKDFTLWEFLRFLLFMPT 
FSSGPIDRFKRFNENYQAIPERDELMDMLDESVRYIMWGFLYKFILAHVLGETLLPPLKNLALQSGGFFNLYALAVMYT 
10 FGLELFFDFAGYSMFALAISNLMGIRSPINFNKPFLSRDLKEFWNRWHMSLSFWFRDFVFMRMVMVLTRKKVFKNRN 
VTSSMAYIVNMLIMGFWHGVTWYYIAYGLFHGLGLVINDAWVRKKKTLNKERKKAGKAALPENRWIQLLGMVVTFH 

VVMLSFLIFSGFLNNLWFKKZ 

MLKRLWMIFGPVLIAGLLVFLLIFFYPTEMHHNLGAEKRSAVATTIDSFKERSQKVRALSDPNVRFVPFFGSSEWLRFD 
15 GAHPAVLAEKYNRSYRPYLLGQGGAASLNQYFGMQQMLPQLENKQVVYVISPQWFSKNGYDPAAFQQYFNGDQLTS 
FLKHQSGDQASQYAATRLLQQFPNVAMKDLVQKLASKEELSTADNEMIELLARFNERQASFFGQFSVRGYVNYDKHV 
AKYLKILPDQFSYQAIEDVVKADAEKNTSNNEMGMENYFYNEQIKKDLKKLKDSQKSFTYLKSPEYNDLQLVLTQFSK 
SKVNPIFIIPPVNKKWMNYAGLREDMYQQTVQKIRYQLESQGFTNIADFSKDGGEPFFMKDTIHLGWLGWLAFDKAVD 

PFLSNPTPAPTYHLNERFFSKDWATYDGDVKEFQZ 

20 

MEKNLKALKQTTDQEGPAIEPEKAEDTKTVQNGYFEDAAVKDRTLSDYAGNWQSVYPFLEDGTFDQVFDYKAKLTG 
KMTQAEYKAYYTKGYHTDVTKINITDNTMEFVQGGQSKKYTYKYVGKKILTYKKGNRGVRFLFEATDADAGQFKYV 
QFSDHNVAPVKAEHFHIFFGGTSQEALFEEMDNWPTYYPDNLSGQEIAQEMLAHZ 

25 MKDGHLLAHHIRLLNGRIFQKLLSQDPEALYRGEQGKILAVLWNSETGCATATDIALATGLANNTLTTMIKKLEEQKL 
VIVSPCGKDKRKKYLVLTELGKSQKEVGHRVSQKLDTIFYKGFSEEEIHQFEGFQERILANLKEKGNEVZ 

MTNLIATFQDRFSDWLTALSQHLQLSLLTLLLAILLAIPLAVFLRYHEKLADWVLQIAGIFQTIPSLALLGLFIPLMGIGTL 
PALTALVIYAIFPILQNTITGLKGIDPNLQEAGIAFGMTRWERLKKFEIPLAMPVIMSGIRTAAVUIGTATLAAUGAGGL 
30 GSFILLGIDRNNASLILIGALSSAVLAIAFNFLLKVMEKAKLRTIFSGFALVALLLGLSYSPALLVQKEKENLVIAGKIGPEP 
EILANMYKLLIEENTSMTATVKPNFGKTSFLYEALKKGDIDIYPEFTGTVTESLLQPSPKVSHEPEQVYQVARDGIAKQD 
HLAYLKPMSYQNTYAVAVPKKIAQEYGLKTISDLKKVEGQLKAGFTLEFNDREDGNKGLQSMYGLNLNVATIEPALRY 
QAIQSGDIQITDAYSTDAELERYDLQVLEDDKQLFPPYQGAPLMKEALLKKHPELERVLNTLAGKITESQMSQLNYQVG 
VEGKSAKQVAKEFLQEQGLLKKZ 

35 

MMHTYLQKKIENIKTTLGEMSGGYRRMVAAMADLGFSGTMKAIWDDLFAHRSFAQWIYLLVLGSFPLWLELVYEHRI 

VDWIGMICSLTGIICVIFVSEGRASNYLFGLINSVIYLILALQKGFYGEVLTTLYFTVMQPIGLLVWIYQAQFKKEKQEFV 

ARKLDGKGWTKYLSISVLWWLAFGFIYQSIGANRPYRDSITDATNGVGQILMTAVYREQWIFWAATNVFSIYLWWGES 

LQIQGKYLIYLINSLVGWYQWSKAAKQNTDLLNZ 

40 

MRNMKAKYAVWVAFFLNLTYAIVEFIAGGVFGSSAVLADSVHDLGDAIAIGISAFLETISNREEDNQYTLGYKRFSLLG 
ALVTAVILVTGSVLVILENVTKILHPQPVNDEGILWLGIIAITINLLASLVVGKGKTKNESILSLHFLEDTLGWVAVILMAI 
VLRFTDWYILDPLLSLVISFFILSKALPRFWSTLKIFLDAVPEGLDIKQVKSGLERLDNVASLNQLNLWTMDALEKNAIV 

HVCLKEMEHMETCKESIRIFLKDCGFQNITIEIDADLETHQTHKRKVCDLERSYEHQHZ 

45 

MIEYKNVALRYTEKDVLRDVNLQIEDGEFMVLVGPSGSGKTTMLKMINRLLEPTDGNIYMDGKRIKDYDERELRLSTG 

YVLQAIALFPNLTVAENIALIPEMKGWSKEEITKKTEELLAKVGLPVAEYGHRLPSELSGGEQQRVGIVRAMIGQPKIFL 

MDEPFSALDAISRKQLQVLTKELHKEFGMTTIFVTHDTDEALKLADRIAVLQDGEIRQVANPETILKAPATDFVADLFG 

GSVHDZ 

50 

MSAVAISAMTKVMQETHGNPSSIHGHGRQAGKLLREARQELAQLLRTKPQHIFFTSGGTEGNNTTIIGYCLRHQEQGKH 
IITTAIEHHAVLETIDYLVQHFGFEATIIQPENQEITAQQIQKALRDDTILVSTMFVNNETGNLLPIAEIGQILKQHPAAYH 
VDAVQAIGKIPIHSEELGIDFLTASAHKFHGPKGIGFLYASSMDFDSYLHGGDQEQKKRAGTENLPAIVGMVAALKEDL 
EKQEEHFQHVQNLETAFLAELEGIQYYLNRGKHHLPYVLNIGFPGQKNDLLLLRLDLAGISISTGSACTAGVVQSSHVLE 

55 AMYGANSERLKESLRISLSPQNTVEDLQTLAKTLKEIIGGZ 



MLFKLSKEKIELGLSRLSPARRIFLSFALVILLGSLLLSLPFVQVESSRATYFDHLFTAVSAVCVTGLSTLPVAHTYNIWG 
QIICLLLIQIGGLGLMTFIGVFYIQSKQKLSLRSRATIQDSFSYGETRSLRKFVYSIFLTTFLVESLGAILLSFRLIPQLGWGR 
60 GLFSSIFLAISAFCNAGFDNLGSTSLFAFQTDLLVNLVIAGLIITGGLGFMVWFDLAGHVGRKKKGRLHFHTKLVLLLTI 
GLLLFGTATTLFLEWNNAGTIGNLPVADKVLVSFFQTVTMRTAGFSTIDYTQAHPVTLLIYILQMFLGGAPGGTAGGLK 
ITTFFVLLVFARSELLGLPHANVARRTIAPRTVQKSFSVFIIFLMSFLIGLILLGITAKGNPPFIHLVFETISALSTVGVTANL 
TPDLGKLALSVIMPLMFMGRIGPLTLFVSLADYHPEKKDMIHYMKADISIGZ 

65 
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MSDRTIGILGLGIFGSSVLAALAKQDMNIIAIDDHAERINQFEPVLARGVIGDITDEELLRSAGIDTCDTVVVATGENLESS 

VLAVMHCKSLGVPTVIAKVKSQTAKKVLEKIGADSVISPEYEMGQSLAQTILFHNSVDVFQLDKNVSIVEMKIPQSWAG 

QSLSKLDLRGKYNLNILGFREQENSPLDVEFGPDDLLKADTYILAVINNQYLDTLVALNSZ 

5 

MKLLSIAISSYNAAAYLHYCVESLVIGGEQVGILIINDGSQDQTQEIAECLASKYPNIVRAIYQENKCHGGAVNRGLVEAS 

GRYFKVVDSDDWVDPRAYLKILETLQELESKGQEVDVFVTNFVYEKEGQSRKKSMSYDSVLPVRQIFGWDQVGNFSK 

GQYTMMHSLIYRTDLLRASQFZ 

10 

MKFNPNQRYTRWSIRRLSVGVASVVVASGFFVLVGQPSSVRADGLNPTPGQVLPEETSGTKEGDLSEKPGDTVLTQAKP 
EGVTGNTNSLPTPTERTEVSEETSPSSLDTLFEKDEEAQKNPELTDVLKETVDTADVDGTQASPAETTPEQVKGGVKEN 
TKDSIDVPAAYLEKAEGKGPFTAGVNQVIPYELFAGDGMLTRLLLKASDNAPWSDNGTAKNPALPPLEGLTKGKYFYE 
VDLNGNTVGKQGQALIDQLRANGTQTYKATVKVYGNKDGKADLTNLVATKNVDININGLVAKETVQJCAVADNVKDS 
15 IDVPAAYLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRLLLKASDKAPWSDNGDAKNPALSPLGENVKTKGQYFYQV 
ALDGNVAGKEKQALIDQFRANGTQTYSATVNVYGNKDGKPDLDNIVATKKVTININGUSKETVQKAVADNVKDSIDV 
PAAYLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRLLLKASDKAPWSDNGDAKNPALSPLGENVKTKGQYFYQLALD 
GNVAGKEKQALIDQFRANGTQTYSATVNVYGNKDGKPDLDNIVATKKVTININGLISKETVQKAVADNVKTVSMFQQP 
TZ 

20 

MKLKSYILVGYIISTLLTILVVFWAVQKMLIAKGEIYFLLGMTIVASLVGAGISLFLLLPVFTSLGKLKEHAKRVAAKDFP 
SNLEVQGPVEFQQLGQTFNEMSHDLQVSFDSLEESEREKGLMIAQLSHDIKTPITSIQATVEGILDGIIKESEQAHYLATIG 
RQTERLNKLVEELNFLTLNTARNQVETTSKDSIFLDKLLIECMSEFQFLIEQERRDVHLQVIPESARIEGDYAKLSRILVN 
25 LVDNAFKYSAPGTKLEVVAKLEKDQLSISVTDEGQGIAPEDLENIFKRLYRVETSRNMKTGGHGLGLAIARELAHQLGG 
ErrVSSQYGLGSTFTLVLNLSGSENKAZ 

MFGQTAQHGLTNSLKDFWIFLLNIGPQLAFFCQMLRCSRSVEQGTGNHRREFNMIQQIFSHFGMTHLGQIKLVYQESID 
30 LELLVNALNHHLLIDRLVLTPNQITIEIDRQIVHGLDLLKGRKDKEIIDIKSMFRQLELASTQQICPINQRVHHGILAFGEIS 
DLVPAKNLPNRQDZ 

MEHLATYFSTYGGAFFAALGIVLAVGLSGMGSAYGVGKAGQSAAALLKEQPEKFASALILQLLPGTQGLYGFVIGILIW 
35 LQLTPELPLEKGVAYFFVALPIAIVGYFSAKHQGNVAVAGMQILAKRPKEFMKGAILAAMVETYAILAFVVSFILTLRVZ 

MLKSEKQSRYQMLNEELSFLLEGETNVLANLSNASALIKSRFPNTVFAGFYLFDGKELVLGPFQGGVSCIRIALGKGVC 
GEAAHFQETVIVGDVTTYLNYISCDSLAKSEIVVPMMKNGQLLGVLDLDSSEIEDYDAMDRDYLEQFVAILLEKTAWD 

40 FTMFEEKSZ 

MSVLEIKDLHVEIEGKEILKGVNLTLKTGEIAAIMGPNGTGKSTLSAAIMGNPNYEVTKGEVLFDGVNILELEVDERAR 
MGLFLAMQYPSEIPGITNAEFLRAAMNAGKEDDEKISVREFITKLDEKMELLNMKEEMAERYLNEGFSGGEKKRNEIL 
45 QLLMLEPTFALLDEIDSGLDIDALKVVSKGVNAMRGEGFGAMIITHYQRLLNYITPDVVHVMMEGRVVLSGGPELAAR 
LEREGYAKLAEELGYDYKEELZ 

MPYKRQRSFSMALSKLDSLYMAVVADHSKNPHHQGKLEDAEQISLNNPTCGDVINLSVKFDAEDRLEDIAFLNSGCTIS 
50 TASASMMTDAVLGKTKQEILELATIFSEMVQGQKDERQDQLGDAAFLSGVAKFPQRIKCATLAWNALKKTIENQEKQZ 

MKIQDLLRKDVMLLDLQATEKTAVIDEMIKNLTDHGYVTDFETFKEGILAREALTSTGLGDGIAMPHSKNAAVKEATV 
LFAKSNKGVDYESLDGQATDLFFMIAAPEGANDTHLAALAELSQYLMKDGFADKLRQATSADQVIELFDQASEKTEEL 

55 VQAPANDSGDFIVAVTACTTGIAHTYMAQEALQKVAAEMGVGIKVETNGASGVGNQLTAEDIRKAKAIIIAADKAVEM 
DRFDGKPLINRPVADGIRKTEELINLALSGDTEVYRAANGAKAATASNEKQSLGGALYKHLMSGVSQMLPFVIGGGIMI 
ALAFLIDGALGVPNENLGNLGSYHELASMFMKIGGAAFGLMLPVFAGYVAYSIAEKPGLVAGFVAGAIAKEGFAFGKIP 
YAAGGEATSTLAGVSSGFLGALVGGFIAGALVLAIKKYVKVPRSLEGAKSILLLPLLGTILTGFVMLAVNIPMAAINTAM 
NDFLGGLGGGSAVLLGIVLGGMMAVDMGGPVNKAAYVFGTGTLAATVSSGGSVAMAAVMAGGMVPPLAIFVATLLF 

60 KDKFTKEERNSGLTNIIMGLSFITEGAIPFGAADPARAIPSFILGSAVAGGLVGLTGIKLMAPHGGIFVIALTSNALLYLVS 
VLVGAIVSGVVYGYLRKPQAZ 

MANKNTSTTRRRPSKAELERKEAIQRMLISLGIAILLIFAAFKLGAAGITLYNLIRLLVGSLAYLAIFGLLIYLFFFKWIRK 
65 QEGLLSGFFTIFAGLLLIFEAYLVWKYGLDKSVLKGTMAQVVTDLTGFRTTSFAGGGLIGVALYIPTAFLFSNIGTYFIGS 
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ILILVGSLLVSPWSVYDIAEFFSRGFAKWWEGHERRKEERFVKQEEKARQKAEKEARLEQEETEKALLDLPPVDMETGE 
ILTEEAVQNLPPIPEEKWVEPEIILPQAELKFPEQEDDSDDEDVQVDFSAKEALEYKLPSLQLFAPDKPKDQSKEKKIVRE 
NIKILEATFASFGIKVTVERAEIGPSVTKYEVKPAVGVRVNRISNLSDDLALALAAKDVRIEAPIPGKSLIGIEVPNSDIATV 
SFRELWEQSQTKjVENFLEIPLGKAVNGTARAFDLSKMPHLLVAGSTGSGKSVAVNGIIASILMKARPDQVKFMMVDPK 
5 MVELSVYNDIPHLLIPVVTNPRKASKALQKVVDEMENRYELFAKVGVRNIAGFNAKVEEFNSQSEYKQIPLPFIVVIVDE 
LADLMMVASKEVEDAIIRLGQKARAAGIHMILATQRPSVDVISGLIKANVPSRVAFAVSSGTDSRTILDENGAEKLLGRG 
DMLFKPIDENHPVRLQGSFISDDDVERIVNFIKTQADADYDESFDPGEVSENEGEFSDGDAGGDPLFEEAKSLVIETQKA 
SASMIQRRLSVGFNRATRLMEELEIAGVIGPAEGTKPRKVLQQZ 

10 

MSYFKKYKFDKSQFKLGMRTFKTGIAVFLVLLIFGFFGWKGLQIGALTAVFSLRESFDESVHFGTSRILGNSIGGLYALV 
FFLLNTFFHEAF\WTLVVVPICrMLTIMTNVAMNNKAGVIGGVAAMLIITLSIPSGETILYVFVRVLETFMGVFVAnVN 
YDIDRIRLFLEKKEKZ 

15 

MNKSEHRHQLIRALITKNKIHTQAELQALLAENDIQVTQATLSRDIKNMNLSKVREEDSAYYVLNNGSISKWEKRLELY 
MEDALVWMRPVQHQVLLKTLPGLAQSFGSIIDTLSFPDAIATLCGNDVCLIICEDADTAQKCFEELKKFAPPFFFEEZ 



20 MKSIKLNALSYMGIRVLNIIFPILTGTYVARVLDRTDYGYFNSVDTILSFFLPFATYGVYNYGLRAISNVKDNKKDLNRT 
FSSLFYLCIACTILTTAVYILAYPLFFTDNPIVKKVYLVMGIQLIAQIFSIEWVNEALENYSFLFYKTAFIRILMLVSIFLFVK 
NEHDIVVYTLVMSLSTLINYUSYFWIKRDIKLVKIHLSDFKPLFLPLTAMLVFANANMLFTFLDRLFLVKTGIDVNVSY 
YTIAQRIVTVIAGVVTGAIGVSVPRLSYYLGKGDKEAYVSLVNRGSRIFNFFIIPLSFGLMVLGPNAILLYGSEKYIGGGIL 
TSLFAFRTIILALDTILGSQILFTNGYEKRITVYTVFAGLLNLGLNSLLFFNHIVAPEYYLLTTMLSETSLLVFYIIFIHRKQL 

25 IHLGHIFSYTVRYSLFSLSFVAIYFLINFVYPVDMVINLPFLINTGLIVLLSAISYISLLVFTKDSIFYEFLNHVLALKNKFKK 

sz 



MELFMKITNYEIYKLKKSGLTNQQILKVLEYGENVDQELLLGDIADISGCRNPAVFMERYFQIDDAHLSKEFQKFPSFSIL 
30 DDCYPWDLSEIYDAPVLLFYKGNLDLLKFPKVAVVGSRACSKQGAKSVEKVIQGLENELVIVSGLAKGIDTAAHMAAL 
QNGGKTIAVIGTGLDVFYPKANKRLQDYIGNDHLVLSEYGPGEQPLKFHFPARNRIIAGLCRGVIVAEAKMRSGSLITCE 
RAMEEGRDVFAIPGSILDGLSDGCHHLIQEGAKLVTSGQDVLAEFEFZ 



35 MKQLTVEDAKQIELEILDYIDTLCKKHNINYIINYGTLIGAVRHEGFIPWDDDIDLSMPREDYQRFINIFQKEKSKYKLLS 
LETDKNYFNNFIKITDSTTKIIDTRNTKTYESGIFIDIFPIDRFDDPKVIDTCYKLESFKLLSFSKHKNIVYKDSLLKDWIRT 
AFWLLLRPVSPRYFANKIEKEIQKYSRENGQYMAFIPSKFKEKEVFPSGTFDKTIDLPFENLSLPAPEKFDTILTQFYGDY 
MTLPPEEKRFYSHEFHAYKLEDZ 

40 

MIKINHLTITQNKDLRDLVSDLTMTIQDGEKVAIIGEEGNGKSTLLKILMGEALSDFTIKGNIQSDYQSLAYIPQKVPEDL 
KKKTLHDYFFLDSIDLDYSILYRLAEELHFDSNRFASDQEIGNLSGGEALKIQLIHELAKPFEILFLDEPSNDLDLETVDW 
LKGQIQKTRQTVIFISHDEDFLSETADTIVHLRLVKHRKEAETLVEHLDYDSYSEQRKANFAKQSQQAANNQRAYDKT 
MEKHRRVKQNVETALRATKDSTAGRLLAKKMKTVLSQEKRYEKAAQSMTQKPLEEEQIQLFFSDIQPLPASKVLVQLE 
45 KENLSIDDRVLVQKLQLTVRGQEKIGIIGPNGVGKSTLLAKLQRLLNDKREISLGFMPQDYHKKLQLDLSPIAYLSKTGE 
KEELQKIQSHLASLNFSYPEMQHQIRSLSGGQQGKLLLLDLVLRKPNFLLLDEPTRNFSPTSQPQIRKLFATYPGGLITVS 

HDRRFLKEVCSIIYRMTEHGLKLVNLEDLZ 



50 MKPKTFYNLLAEQNLPLSDQQKEQFERYFELLVEWNEKINLTAITDKEEVYLKHFYDSIAPILQGLIPNETIKLLDIGAGA 
GFPSLPMKILYPELDVTIIDSLNKRINFLQLLAQELDLNGVHFYHGRAEDFAQDKNFRAQYDFVTARAVARMQVLSELT 
IPYLKVGGKLLALKASNAPEELLEAKNALNLLFSKVEDNLSYALPNRDPRYITVVEKKKETPNKYPRKAGMPNKRPLZ 



55 MSIKLJAVDIDGTLVNSQKEITPEVFSAIQDAKEAGVKVVIATGRPIAGVAKLLDDLQLRDEGDYVVTFNGALVQETATG 

heiisesltyedyldmeflsrklgvhmhaitkdgiytanrnigkytvhestlvsmprfyrtpeemagkeivkcmfidepei 

ldaaiekipaefyerysinksapfylellkknvdkgsaithlaeklgltkdetmaigdeendramlevvgnpvvmengn 

peikkiakyitktndesgvahairtwvlz 

60 

mtwiilgvialivifvivsynglvknrmqtkeawsqidvqlkrrndllpnlietvkgyakyegstlekvaelrnqvaaa 
tspaeamkasdaltrqvsgifavaesypdlkasanfvklqeeltntenkisysrqlynsvvsnynvkletfpsniiagmf 
gfkaadflqtpeeeksvpkvdfsglgdz 

65 
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MLFDQIASNKRKTWILLLVFFLLLALVGYAVGYLFIRSGLGGLVIALIIGFIYALSMIFQSTEIVMSMNGAREVDEQTAPD 
LYHVVEDMALVAQIPMPRVFIIDDPALNAFATGSNPQNAAVAATSGLLAIMNREELEAVMGHEVSHIRNYDIRISTIAV 
ALASAITMLSSMAGRMMWWGGAGRRRSDDDRDGNGLEIIMLVVSLLAIVLAPLAATLVQLAISRQREFLADASSVELT 
^ RNPQGMINALDKLDNSKPMSRHVDDASSALYINDPKKGGGFQKLFYTHPPISERIERLKQMZ 

MKLNIQEIRKQSEGLNFEQTLDLVDDLRARNQEILDVKDILAVGKVQYEDRMYFLDYQLSYTIVLASSRSMEPVELVES 
YPVTEVFMEGATNQLDQEVLDDDLVLPIENGELDLAESVSDNILLNIPIKVLTAEEEAGQGFISGNDWQIMTEEEYQAQ 
^ ^ KAVKKEENSPFAGLQGLFDGDEZ 

MKRQLALVVFSGGQDSTTCLFWVMQHYETVEAVTFAYGQRHHLEIQITREIAKEQGIRHHILDMSLLGQITAQPDFATI 
HISYIPDKLCVESKSLKLYLFSYRNHGDFHENCINTIGKDLVNLLDPRYLEVWGKFTPRGGISIDPYYNYGKQGTKYEGL 
^ ^ AEQRLFQHDLYPEKIDNRZ 

MTETVEDKVSHSITGLDILKGIVAAGAVISGTVATQTKVFTNESAVLEKTVEKTDALATNDTVVLGTISTSNSASSTSLSA 
SESASTSASESASTSASTSASTSASESASTSASTSISASSTVVGSQTAAATEATAKKVEEDRKKPASDYVASVTNVNLQSYA 
KRRKRSVDSIEQLLASIKNAAVFSGNTIVNGAPAINASLNIAKSETKVYTGEGVDSVYRVPIYYKLKVTNDGSKLTFTYT 

20 VTYVNPKTNDLGNISSMRPGYSIYNSGTSTQTMLTLGSDLGKPSGVKNYITDKNGRQVLSYNTSTMTTQGSGYTWGNG 
AQMNGFFAKKGYGLTSSWTVPITGTDTSFTFTPYAARTDRIGINYFNGGGKVVESSTTSQSLSQSKSLSVSASQSASASAS 
TSASASASTSASASASTSASASASTSASVSASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASE 
SASTSASASASTSASESASTSASASASTSASASASTSASGSASTSTSASASTSASASASTSASASASISASESASTSASESASTST 
SASASTSASESASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSASASASTSASASASTS 

25 ASVSASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASESASTSASASASTSASESASTSASASA 
STSASASASTSASGSASTSTSASASTSASASASTSASASASISASESASTSASESASTSTSASASTSASESASTSASASASTSASA 
SASTSASASARQVRRPQPVHLNRHQPVRQPQQVLVHQLQHQRVHRLQHQPVPRLQRQPVRQLQQVPVLQSQHQQVLQ 
PQHRQVPRLQQAHQHLNQRRQAPQLQQVPVRQPQRRQVRQPQQVLVHQLQHQRVHRLRRQPVHQSQQVPVRQLPHQ 
QVPRLQQAPVRRLQQVLAPQPQPQPVRQPQQVSQRLNRHQRVRPLQQVLAPQPQRQQVHRLQRQRVRLNRHQRVRPL 

30 QQVLAPQPQRQQVHRLQHQRVRPLQQVLAPQPQRQQVHRLQRQRVRLSQHQRVRQPQQAHQLLNLHQPVRQPQHRQ 
APQLQQVPVRQPQRRQVRRLQQVPVRQPQQVPVRQPQRRQVRRPQPVHLNRHQPVRQPQQVLVHQLQHQRVHRLQH 
QPVHQSQQVPVRQFRINKCLGFSKYZ 

35 MGVETWFYSSICWLAIGLGSVWKFPYMTAANGGGGFLLIFLISTILIGFPLLLAEFALGRSAGVSAIKTFGKLGKNNKYN 
FIGWIGAFALFILLSFYSVIGGWILVYLGIEFGKLFQLGGTGDYAQLFTSIISNPAIALGAQAAFILLNIFIVSRGVQKGIERA 
SKVMMPLLFIVFVFIIGRSLSLPNAMEGVLYFLKPDFSKLTSTGLLYALGQSFFALSLGVTVMLTYASYLDKKTNLVQSG 
ISIVAMNISISIMAGLAIFQARSPFNIQSEGGPSLLFIVLPQLFDKMPFGTIFYVLFLLLFLFATVTFSVVMLEINVDNITNQD 
NSKRAKWSVILGILTFVFGIPSALSYGVMADVHIFGKTFFDAMDFLVSNLLMPFGALYLSLFTGYIFKKALAMEELHLD 

40 ERAWKQGLFQVWLFLLRFFVSSFQSSSLWSSLPNLCNQKGLEZ 

MLKKWQLKDVILLAFLSIFFGGVFVGSGYVYNILSLLLTPLGLQAFANEILFGLWCMAAPIAAIFVPRVGSATIGEVLAA 
LAEVLYGSQFGLGALLSGFVQGLGSEFGFIVTKNRYESWLSLTANSIGITLVSFVYEYIKLGYYAFSLPFVLSLLVVRFISV 

45 YFFCTILVRAIVKLYHQFATGGKAZ 

MVKVATQTPIISLFLLILSLETSFIPSIALTLSVVAFCILFMLYYRRFKMLAWMIILAILPSFANYWAVQLHGDASQAVML 
GTRAFVTVCIGLVFVSSVSLKELLLYLAQKGLSRSWSYALIVVFNSFPLIQQEIKSLKEACLLRGQELHFWSPLIYSKVLM 
50 TVFRWRHLYLRALSAHGYDEHAQLKNSYRTFYIPKKTKLIYLLFFLLLQTSLFLZ 

MRKHQLQVHKLTILSMMIALDVVLTPIFRIEGMAPMSSVVNILAGIMMGPVYALAMATVTAFIRMTTQGIPPLALTGAT 
FGALLAGLFYKYGRKFHYSALGEILGTGIIGSIVSYPVMVLFTGSAAKLSWFIYTPRFFGATLIGTAISFIAFRFLIKQEFFK 
55 KVQGYFFSERIDZ 

MQEFTNPFPIGSSSLIHCITNEISCEMLANGILALGCKPVMADDSREVLDFTKQSQALFINLGHLSAEKEKAIRMAASYAN 
QSSLPMVVDAVGVTTSSIRKSLVKDLLDYRPTVLKGNMSEIRSLVGLKHHGVGVDASAKDQETEDLLQVLKDWCQTYP 
60 GMSFLVTGPKDLVVSKNQVAVLGNGCTELDWITGTGDLVGALTAVFLSQGKTGFEASCLAVSYLNIAAEKIVVQGMG 
LEEFRYQVLNQLSLLRRDENWLDTIKGEVYEZ 

MNHKIAILSDVHGNATALEAVIADAKNQGASEYWLLGDIFLPGPGANDLVALLKDLPITASVRGNWDDRVLEALDGQ 
65 YGLEDPQEVQLLRMTQYLMERMDPATIVWLRSLPLLEKKEIDGLRFSISHNLPDKNYGGDLLVENDTEKFDQLLDAET 
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DVAVYGHVHKQLLRYGSQGQQIINPGSIGMPYFNWEALKNHRSQYAVIEVEDGELLNIQFRKVAYDYEAELELAKSKG 
LPFIEMYEELRRDDNYQGHNLELLASLIEKHGYVEDVKNFFDFLZ 



5 MNVNQIVRIIPTLKANNRKLNETFYIETLGMKALLEESAFLSLGDQTGLEKLVLEEAPSMRTRKVEGRKKLARLIVKVE 
NPLEIEGILSKTDSIHRLYKGQNGYAFEIFSPEDDLILIHAEDDIASLVEVGEKPEFQTDLASISLSKFEISMELHLPTDIESF 
LESSEIGASLDFIPAQGQDLTVDNTVTWDLSMLKFLVNELDIASLRQKFESTEYFIPKSEKFFLGKDRNNVELWFEEVZ 



10 MKWTKIIKKIEEQIEAGIYPGASFAYFKDNQWTEFYLGQSDPEHGLQTEAGLVYDLASVSKVVGVGTVCTFLWEIGQLD 
IDRLVIDFLPESDYPDITIRQLLTHATDLDPFIPNRDLLTAPELKEAMFHLNRJRSQPAFLYSDVHFLLLGFILERIFNQDLD 
VILKDQVWKPWGMTETKFGPVELAVPTVRGVEAGIVHDPKARLLGRHAGSAGLFSTIKDLQIFLEHYLADDFARDLNQ 
NFSPLDDKERSLAWNLEGDWLDHTGYTGTFIMWNRQKQEATIFLSNRTYEKDERAQWILDRNQVMNLIRKEEZ 

15 

MMKKTYNHILVWGVIFYSICIVCFCFTPQEQSTVGVGTPGIQHLGRLVFLLTPFNSLWKLGEVSDIGQLCWIFLQNILNV 
FLFFPLIFQLLYLFPNLRKTKKVLLFSFLVSLGIECTQLILDFFFDFNRVFEIDDLWTNTLGGYLAWLLYKRLHKNKVRN 

Z 

20 

MKIPLLTFARHKFVYVLLTLLFLALVYRDVLMTYFFFDIHAPDLAKFDGQAIKNDLLKSALDFRILQFNLGFYQSFIIPIII 

VLLGFQYIELKNKVLRLSIGREVSYQGLKRKLTLQVASIPCLIYLVTVLIIAIITYFFGTFSPLGWNSLFSDGSGLQRLLDGE 

IKSYLFFTCVLLIGIFINAIYFLQIVDYVGNVTRSAITYLMFLWLGSMLLYSALPYYMVPMTSLMQASYGDVSLMKLFTP 

YILYIVPYMVLEKYEDNVZ 

25 

MFKVLQKVGKAFMLPIAILPAAGLLLGIGGALSNPTTIATYPILDNSIFQSIFQVMSSAGEVVFSNLSLLLCVGLCIGLAKR 
DKGTAALAGVTGYLVMTATIKALVKLFMAEGSAIDTGVIGALVVGIVAVYLHNRYNNIQLPSALGFFGGSRFVPIVTSF 
SSILIGFVFFVIWPPFQQLLVSTGGYISQAGPIGTFLYGFLMRLSGAVGLHHIIYPMFWYTELGGVETVAGQTVVGAQKIF 
30 FAQLADLAHSGLFTEGTRFFAGRFSTMMFGLPAACLAMYHSVPKNRRKKYAGLFFGVALTSFITGITEPIEFMFLFVSPV 
LYVVHAFLDGVSFFIADVLNISIGNTFSGGVIDFTLFGILQGNAKTNWVLQIPFGLIWSVLYYIIFRWFITQFNVLTPGRGE 
EVDSKEISESADSTSNTADYLKQDSLQIIRALGGSNNIEDVDACVTRLRVAVKEVNQVDKALLKQIGAVDVLEVKGGIQ 

AIYGAKAILYKNSINEILGVDDZ 

35 MKFRKLACTVLAGAAVLGLAACGNSGGSKDAAKSGGDGAKTEITWWAFPVFTQEKTGDGVGTYEKSIIEAFEKANPDI 
KVKLETIDFKSGPEKITTAIEAGTAPDVLFDAPGRIIQYGKNGKLAELNDLFTDEFVKDVNNENIVQASKAGDKAYMYPI 
SSAPPYMAMNKKMLEDAGVANLVKEGW1TDDFEKVLKALKDKGYTPGSLFSSGQGGDQGTRAFISNLYSGSVTDEKV 
SKYTTDDPKFVKGLEKATSWIKDNLINNGSQFDGGADIQNFANGQTSYTILWAPAQNGIQAKLLEASKVEVVEVPFPSD 
EGKPALEYLVNGFAVFNNKDDKKVAASKKFIQFIADDKEWGPKDVVRTGAFPVRTSFGKLYEDKRMETISGWTQYYSP 

40 YYNTIDGFAEMRTLWFPMLQSVSNGDEKPADALKAFTEKANETIKKAMKQZ 



MQSTEKKPLTAFTVISTIILLLLTVLFIFPFYWILTGAFKSQPDTIVIPPQWFPKMPTMENFQQLMVQNPALQWMWNSVFI 
SLVTMFLVCATSSLAGYVLAKKRFYGQRILFAIFIAAMALPKQVVLVPLVRIVNFMGIHDTLWAVILPLIGWPFGVFLM 
45 KQFSENIPTELLESAKIDGCGEIRTFWSVAFPIVKPGFAALAIFTFINTWNDYFMQLVMLTSRNNLTISLGVATMQAEMA 
TNYGUMAGAALAAVPIVTVFLVFQKSFTQGITMGAVKGZ 



MKIMFKNFNNILLNRKIVLLLRIVLMMILINHLLSTAVQKQDAVIFFKRELISIFSYNDYSEANLEIPKLLLNLSLFMVGW 
50 LSVILLESDLADHYHHLIRYQSSSFFDYTRKRLVVISKFFTQDLFVWFLGLLPLGIHFKTVALFFLLAQLMMLYLLLSYLI 
ALISAGAGFSFFLYFLAFVGQEWMMDHIVTVYLVLLSLLVMLIVSRLEEKFKKGZ 



MGKGEMGKGVIGLEFDSEVLVNKAPTLQLANGKTATFLTQYDSKTLLFAVDKEDIGQEIIGIAKGSIESMHNLPVNLAG 
55 ARVPGGVNGSKAAVHEVPEFTGGVNGTEPAVHEIAEYKGSDSLVTLTTKKDYTYKAPLAQQALPETGNKESDLLASLG 

LTAFFLGLFTLGKKREQZ 



60 MKKTFFLLVLGLFCLLPLSVFAIDFKINSYQGDLYIHADNTAEFRQKIVYQFEEDFKGQIVGLGRAGKMPSGFDIDPHPKI 
QAAKNGAELADVTSEVTEEADGYTVRVYNPGQEGDIVEVDLVWNLKNLLFLYDDIAELNWQPLTDSSESIEKFEFHVR 
GDKGAEKLFFHTGKLFREGTIEKSNLDYTIRLDNLPAKRGVELHAYWPRTDFASARDQGLKGNRLEEFNKIEDSIVREK 
DQSKQLVTWVLPSILSISLLLSVCFYFIYRRKTTPSVKYAKNHRLYEPPMELEPMVLSEAVYSTSLEEVSPLVKGAGKFTF 
DQLIQATLLDVIDRGNVSIISEGDAVGLRLVKEDGLSSFEKDCLNLAFSGKKEETLSNLFADYKVSDSLYRRAKVSDEKR 

65 IQARGLQLKSSFEEVLNQMQEGVRKRVSFWGLPDYYRPLTGGEKALQVGMGALTILPLFIGFGLFLYSLDVHGYLYLPL 
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PILGFLGLVLSVFYYWKLRLDNRDGVLNEAGAEVYYLWTSFENMLREIARLDQAELESIVVWNRLLVYATLFGYADK 
VSHLMKVHQIQVENPDINLYVAYGWHSTFYHSTAQMSHYASVANTASTYSVSSGSGSSGGGFSGGGGGGSIGAFZ 



5 MKKVRKIFQKAVAGLCCISQLTAFSSIVALAETPETSPAIGKVVIKETGEGGALLGDAVFELKNNTDGTTVSQRTEAQTG 
EAIFSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTPYQIIKVDGS 
EKNGQHKALNPNPYERVIPEGTLSKRIYQVNNLDDNQYGIELTVSGKTVYEQKDKSVPLDVVILLDNSNSMSNIRNKNA 
RRAERAGEATRSLIDKITSDSENRVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNYDQTSFTTNTKDYSYLKL 
TNDKNDIVELKNKVPTEAEDHDGNRLMYQFGATFTQKALMKADEILTQQARQNSQKVIFHITDGVPTMSYPINFNHAT 

10 FAPSYQNQLNAFFSKSPNKDGILLSDFITQATSGEHTIVRGDGQSYQMFTDKTVYEKGAPAAFPVKPEKYSEMKAAGYA 
VIGDPINGGYIWLNWRESILAYPFNSNTAKITNHGDPTRWYYNGNIAPDGYDVFTVGIGINGDPGTDEATATSFMQSISS 
KPENYTNVTDTTKILEQLNRYFHTIVTEKKSIENGTITDPMGELIDLQLGTDGRFDPADYTLTANDGSRLENGQAVGGP 
QNDGGLLKNAKVLYDTTEKRIRVTGLYLGTDEKVTLTYNVRLNDEFVSNKFYDTNGRTTLHPKEVEQNTVRDFPIPKI 
RDVRKYPErriSKEKKLGDIEFIKVNKNDKKPLRGAVFSLQKQHPDYPDIYGAIDQNGTYQNVRTGEDGKLTFKNLSDG 

15 KYRLFENSEPAGYKPVQNKPIVAFQIVNGEVRDVTSIVPQDIPAGYEFTNDKHYITNEPIPPKREYPRTGGIGMLPFYUG 
CMMMGGVLLYTRKHP2 



MKSINKFLTMLAALLLTASSLFSAATVFAAGTTTTSVTVHKLLATDGDMDKIANELETGNYAGNKVGVLPANAKEIAG 
20 VMFVWTNTTNNEIIDENGQTLGVNIDPQTFKLSGAMPATAMKKLTEAEGAKFNnrANLPAAKYKIYEIHSLSTYVGEDGA 
TLTGSKAVPIEIELPLNDVVDAHVYPKNTEAKPKIDKDFKGKANPDTPRVDKDTPVNHQVGDVVEYEIVTKIPALANYA 
TANWSDRMTEGLAFNKGTVKVTVDDVALEAGDYALTEVATGFDLKLTDAGLAKVNDQNAEKTVKITYSATLNDKAI 
VEVPESNDVTFNYGNNPDHGNTPKPNKPNENGDLTLTKTWVDATGAPIPAGAEATFDLVNAQTGKVVQTVTLTTDKN 
TVTVNGLDKNTEYKFVERSIKGYSADYQEITTAGEIAVKNWKDENPKPLDPTEPKVVTYGKKFVKVNDKDNRLAGAEF 
25 VIANADNAGQYLARK^DKVSQEEKQLVVTTKDALDRAVAAYNALTAQQQTQQEKEKVDKAQAAYNAAVIAANNAF 
EWVADKDNENVVKLVSDAQGRFEITGLLAGTYYLEETKQPAGYALLTSRQKFEVTATSYSATGQGIEYTAGSGKDDAT 
KVVNKKITIPQTGGIGTIIFAVAGAAIMGIAVYAYVKNNKDEDQLAZ 



30 MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDDRV 
QIVRDLHSWDENKLSSFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAEFLFEMTDQTVEPLVIVAKKTDTM 
TTKVKLIKVDQDHNRLEGVGFKLVSVARDVSEKEVPLIGEYRYSSSGQVGRTLYTDKNGEIFVTNLPLGNYRFKEVEPL 
AGYAVTTLDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEESGHYTPVLQNGKEVVVTS 
GKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTVVKNNKRPRIDVPDTGEETLVYLDACCHFV 

35 vwz 



MSHIYLSIFTSLLLMLGLVNVAQADEYLRIGMEAAYAPFNWTQDDDSNGAVKIDGTNQYANGYDVQIAKKIAKDLGKE 
PLVVKTKWEGLVPALTSGKIDMIIAGMSPTAERKQEIAFSSSYYTSEPVLLVKKDSAYASAKSLDDFNGAKITSQQGVYL 
40 YNLIAQIPGAKKETAMGDFAQMRQALEAGVIDAYVSERPEALTAEAANSKFKMIQVEPGFKTGEEDTAIAIGLRKNDNR 
ISQINASIETISKDDQVALMDRMIKEQPAEATTTEETSSSFFSQVAKILSENWQQLLRGAGITLLISIVGTnGLIIGLAIGVFR 
TAPLSENKVIYGLQKLVGWVLNVYIEIFRGTPMIVQSMVIYYGTAQAFGINLDRTLAAIFIVSINTGAYMTEIVRGGILAV 
DKGQFEAATALGMTHNQTMRKIVLPQVVRNILPATGNEFVINIKDTSVLNVISVVELYFSGNTVATQTYQYFQTFTIIAV 
lYFVLTFTVTRILRFIERRMDMDTYTTGANQMQTEDLKZ 

45 

MTQAILEIKHLKKSYGQNEVLKDISLTVHKGEVISIIGSSGSGKSTFLRSINLLETPTDGQILYHGQNVLEKGYDLTQYREK 
LGMVFQSFNLFENLNVLEOTIVAQTTVLKRERTEAEKIAKENLEKVGMGERYWQAKPKQLSGGQKQRVAIARALSMN 
PDAILFDEPTSALDPEMVGEVLKIMQDLAQEGLTMIVVTHEMEFARDVSHRVIFMDKGVIAEEGKPEDLFTNPKEDRTK 
50 EFLQRYLKZ 



MKKYQLLFKISAVFSYLFFVFSLSQLTLIVQNYWQFSSQIGNLFWIQNILSLLFIGVMIVVLVKTGHGYLFRIPRKKWLW 
YSILTVLVLVFQISFNVQTAKHVQSTAEGWAVLIGYSGTNFAELGIYIALFFLVPLMEELIYRGLLQHAFFKHSRFGLDLL 
55 LPSILFALPHFSSLPSLLDIFVFATVGIIFAGLTRYTKSIYPSYAVHVINNIVATFPFLLTFLHRVLGZ 



MNKKQWLGLGLVAVAAVGLAACGNRSSRNAASSSDVKTKAAIVTDTGGVDDKSFNQSAWEGLQAWGKEHNLSKDN 
GFTYFQSTSEADYANNLQQAAGSYNLIFGVGFALNNAVKDAAKEHTDLNYVLIDDVIKDQKNVASVTFADNESGYLA 
60 GVAAAKTTKTKQVGFVGGIESEVISRFEAGFKAGVASVDPSIKVQVDYAGSFGDAAKGKTIAAAQYAAGADIVYQVAG 
GTGAGVFAEAKSLNESRPENEKVWVIGVDRDQEAEGKYTSKDGKESNFVLVSTLKQVGTTVKDISNKAERGEFPGGQV 
IVYSLKDKGVDLAVTNLSEEGKKAVEDAKAKILDGSVKVPEKZ 
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MSKKLQQISVPLISVFLGILLGAIVMWIFGYDAIWGYEELFYTAFGSLRGIGEIFRAMGPLVLIGLGFAVASRAGFFNVGL 
PGQALAGWILSGWFALSHPDMPRPLMILATIVIALIAGGIVGAIPGILRAYLGTSEVIVTIMMNYIVLYVGNAFIHAFPKD 
FMQSTDSTIRVGANATYQTPWLAELTGNSRMNIGIFFAIIAVAVIWFMLKKTTLGFEIRAVGLNPHASEYAGISAKRTIIL 
SMIISGALAGLGGAVEGLGTFQNVYVQGSSLAIGFNGMAVSLLAANSPIGILFAAFLFGVLQVGAPGMNAAQVPSELVSI 
5 VTASIIFFVSVHYLIERFVKPKKQVKGGKZ 



MGVKKKLKLTSLLGLSLLIMTACATNGVTSDITAESADFWSKLVYFFAEIIRFLSFDISIGVGIILFTVLIRTVLLPVFQVQ 
MVASRKMQEAQPRIKALREQYPGRDMESRTKLEQEMRKVFKEMGVRQSDSLWPILIQMPVILALFQALSRVDFLFCTGH 
10 FLWINLGSVDTTLVLPILAAVFTFLSTWLSNKALSERNGATTAMMYGIPVUFIFAVYAPGGVALYWTVSNAYQVLQTY 
FLNNPFKIIAEREAVVQAQKDLENRKRKAKKKAQKTKZ 



MVIDPFAINELDYYLVSHFHSDHIDPYTAAAILNNPKLEHVKFIGPYHCGRIWEGWGVPKERIIVVKPGDTIELKDMKIH 
15 AVESFDRTCLVTLPVNGADETGGELAGLAVTDEEMAQKAVNYIFETPGGTIYHGADSHFSNYFAKHGKDFKIDVALNN 
YGENPVGIQDKMTSIDLLRMAENLRTKVIIPVHYDIWSNFMASTNEILELWKMRKDRLQYDFHPFIWEVGGKYTYPQD 
QHLVEYHHPRGFDDCFEQDSNIQFKALLZ 



20 MFLSGWLSSFANTYIHDLLGVLFPDSPFLNAFESAIAAPLVEEPLKLLSLVFVLALIPVRKLKSLFLLGIASGLGFQMIKDI 
GYIRTDLPEGFDFTISRILERIISGIASHWTFSGLAVVGVYLLYRAYKGQKVGKKQGLIFLGLALGTHFLFNSPFVELETEL 
PLAIPVVTAIALYGFYHAYCFVEKHNELMTZ 



25 MKVEPRCDVLSRMSHFFIRILIMELQELVERSWAIRQAYHELEVKHHDSKWTVEEDLLALSNDIGNFQRLVMTKQGRY 
YDETPYTLEQKLSENIWWLLELSQRLDIDILTEMENFLSDKEKQLNVRTWKZ 



MLDWKQFFLAYLRSRSRLFIYLLSLAFLVLLFQFLFASLGIYFLYFFFLCCFVTILFFTWDILVETQVYRQELLYGEREAK 
30 SPLEIALAEKLEAREMELYQQRSKAERKLTDLLDYYTLWVHQIKTPIAASQLLVAEVVDRQLKQQLEQEIFKIDSYTNLV 
LQYLRLESFHDDLVLKQVQIEDLVKEIIRKYALFFIQKGLNVNLHDLDKEIVTDKKWLLVVIEQIISNSLKYTKEGGLEIY 
MDDQELCIKDTGIGIKNSDVLRVFERGFSGYNGRLTQQSSGLGLYLSKKISEELGHQIRIESEVGKGTTVRIQFAQVNLVL 
EZ 

35 

MELNTHNAEILLSAANKSHYPQDELPEIALAGRSNVGKSSFINTMLNRKNLARTSGKPGKTQLLNFFNIDDKMRFVDVP 
GYGYARVSKKEREKWGCMIEEYLTTRENLRAVVSLVDLRHDPSADDVQMYEFLKYYEIPVIIVATKADKIPRGKWNKH 
ESAIKKKLNFDPSDDFILFSSVSKAGMDEAWDAILEKLZ 

40 

MTKKQLHLVIVTGMSGAGKTVAIQSFEDLGYFTIDNMPPALLPKFLQLVEIKEDNPKLALVVDMRSRSFFSEIQAVLDEL 
ENQDGLDFKILFLDAADKELVARYKETRRSHPLAADGRILDGIKLERELLAPLKNMSQNVVDTTELTPRELRKTLAEQF 
SDQEQAQSFRIEVMSFGFKYGIPIDADLVFDVRFLPNPYYLPELRNQTGVDEPVYDYVMNHPESEDFYQHLLALIEPILP 
SYQKEGKSVLTIAMGCTGGQHRSVAFAKRLAQDLSKNWSVNEGHRDKDRRKETVNRSZ 

45 

MRKPKITVIGGGTGSPVILKSLREKDVEIAAIVTVADDGGSSGELRKNMQQLTPPGDLRNVLVAMSDMPKFYEKVFQYR 

FSEDAGAFAGHPLGNLIIAGLSEMQGSTYNAMQLLSKFFHTTGKIYPSSDHPLTLHAVFQDGTEVAGESHIVDHRGIIDN 

VYVTNALNDDTPLASRRVVQTILESDMIVLGPGSLFTSILPNIVIKEIGRALLETKAEIAYVCNIMTQRGETEHFTDSDHV 

50 EVLHRHLGRPFIDTVLVNIEKVPQEYMNSNRFDEYLVQVEHDFVGLCKQVSRVISSNFLRLENGGAFHDGDLIVDELMR 
I 

IQVBCKZ 



55 MKNLIKLLIIRLIVNLADSVFYIVALWHVSNNYSSSMFLGIFIAVNYLPDLLLIFFGPVIDRVNPQKILIISILVQLAVAVIFL 
LLLNQISFWVIMSLVFISVMASSISYVIEDVLIPQVVEYDKIVFANSLFSISYKVLDSIFNSFASFLQVAVGFILLVKIDIGIFL 
LALFILLLLKFRTSNANIENFSFKYYKREVLQGTKFILNNKLLFKTSISLTLINFFYSFQTVVVPIFSIRYFDGPIFYGIFLTLA. 
GLGGILGNMLAPIVIKYLKSNQIVGVFLFLNGSSWLVAIVIKDYTLSLILFFVCFMSKGVFNIIFNSLYQQIPPHQLLGRVN 
TTIDSIISFGMPIGSLVAGTUDLNIELVLIAISIPYFLFSYIFYTDNGLKEFSIYZ 

60 

MMSNKNKEILIFAILYTVLFMFDGVKLLASLMPSAIANYLVYVVLALYGSFLFKDRLIQQWKEIRKTKRKFFFGVLTGW 

LFLILMTVVFEFVSEMLKQFVGLDGQGLNQSNIQSTFQEQPLLIAVFACVIGPLVEELFFRQVLLHYLQERLSGLLSIILV 

GLVFALTHMHSLALSEWIGAVGYLGGGLAFSIIYVKEKENIYYPLLVHMLSNSLSLIILAISIVKZ 

65 
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LKKPIIEFKNVSKVFEDSNTKVLKDINFELEEGKFYTLLGASGSGKSTILNIIAGLLDATTGDIMLDGVRINDIPTNKRDVH 
TVFQSYALFPHMNVFENVAFPLRLRKIDKKEIEQRVAEVLKMVQLEGYEKRSIRKLSGGQRQRVAIARAIINQPRVVLLD 
EPLSALDLKLRTDMQYELRELQQRLGITFVFVTHDQEEALAMSDWIFVMNDGEIVQSGTPVDIYDEPINHFVATFIGESN 
5 ILPGTMIEDYLVEFNGKRFEAVDGGMKPNEPVEVVIRPEDLRITLPEEGKLQVKVDTQLFRGVHYEIIAYDELGNEWMI 
HSTRKAIVGEEIGLDFEPEDIHIMRLNETEEEFDARIEEYVEIEEQEAGLINAIEEERDEENKLZ 



MKSMRILFLLALIQISLSSCFLWKECILSFKQSTAFFIGSMVFVSGICAGVNYLYTRKQEVHSVLASKKSVKLFYSMLLLIN 
10 LLGAVLVLSDNLFIKNTLQQELVDFLLPSFFFLFGLDLLIFLPLKKYVRDFLAMLDRKKTVLVTILATLLFLRNPMTIVSL 
LIYIGLGLFFAAYLVPNSVKKEVSFYGHIFRDLVLVIVTLIFFZ 



MVKKIIGMVLALLSVTVVGVGVFAYTIYQQGTETLAKTYKKIGEETKVIEATEPLTILLMGVDTGNVERTETWVGRSDS 
15 MILMTVNPKTKKTTMMSLERDILTRIESGNGQAHEAKLNSAYADGGAELAIETIQKMMNIHIDRYVMVNMRGLQKLV 
DAVGGITVNNILGFPISISDQEEFNTISIGVGEQHIGGEEALVYARMRYQDPEGDYGRQKRQREVIQKVMEKALSLNSIGH 
YQEILKALSDNMQTNIDLSAKSIPNLLGYKDSFKTIETQQLQGEGEILQGVSYQIVSRAHMLEMQNLLRRSLGQEEVTQL 
ETNAVLFEDLFGRAPVGDEDNZ 

20 

MKKQAYVIIALTSFLFVFFFSHSLLEILDFDWSIFLHDVEKTEKFVFLLLVFSMSMTCLLALFWRGIEELSLRKMQANLK 
RLLAGQEVVQVADPDLDASFKSLSGKLNLLTEALQKAENQSLAQEEEIIEKERKRIARDLHDTVSQELFAAHMILSGISQ 
QALKLDREKMQTQLQSVTAILETAQKDLRVLLLHLRPVELEQKSLIEGIQILLKELEDKSDLRVSLKQNMTKLPKKIEEHI 
FRILQELISNTLRHAQASCLDVYLYQTDVELQLKVVDNGIGFQLGSLDDLSYGLRNIKERVEDMAGTVQLLTAPKQGLA 
25 VDIRIPLLDKEZ 



MIVSIISQGFVWAILGLGIFMTFRILNFPDMTTEGSFPLGGAVAVTLITKGVNPFLATLVAVGAGCLAGMAAGLLYTKGK 
IPTLLSGILVMTSCHSIMLLIMGRANLGLLGTKQIQDVLPFDSDLNQLLTGLIFVSIVIALMLFFLDTKLGQAYIATGDNP 
30 DMARSFGIHTGRMELMGLVLSNGVIALAGALIAQQEGYADVSRGIGVIVVGLASLIIGEVIFKSLSLAERLVTIVVGSIAY 
QFLVWAVIALGFNTSYLRLYSALILAVCLMIPTFKQTILKGAKLSKZ 



MKKMKVWSTVLATGVALTTLAACSGGSNSTTASSSEEKADKSQELVIYSNSVSNGRGDWLTAKAKEAGFNIKMVDIAG 
35 AQLADRVIAEKNNAVADMVFGIGAVDSNKIRDQKLLVQYKPKWLDKIDQSLSDKDNYYNPVIVQPLVLIGAPDVKEMP 
KDWTELGSKYKGKYSISGLQGGTGRAILASILVRYLDDKGELGVSEKGWEVAKEYLKNAYTLQKGESSIVKMLDKEDPI 
QYGMMWGSGALVGQKEQNVVFKVMTPEIGVPFVTEQTMVLSTSKKQALAKEFIDWFGQSEIQVEYSKNFGSIPANKD 
ALKDLPEDTKKFVDQVKPQNIDWEAVGKHLDEWVEKAELEYVQZ 

40 

MIKFDNIQIKYGDFVAIDNLNLDIHEGEFFTFLGPSGCGKSTTLRALVGFLDPSSGSIEVNGTDVTHLEPEKRGIGIVFQSY 
ALFPTMTVFDNIAFGLKVKKVAPDVIKAKVSAVAAKIKISDQQLQRNVSELSGGQQQRVALARALVLEPKILCLDEPLS 
NLDAKLRVDLRKELKRLQKELGITTLYVTHDQEEALTLSDRIAVFNNGYIEQVGTPVEIYHNSQTEFVCDFIGDINVLTD 
ETVHEVLLKNTSVFLEDKKGYIRLEKVRFNRETEQDFILKGTIIDVEFSGVTIHYTIKVSESQILNVTSIDSQAAIRSVGESV 
45 ELFITPSDVLQFZ 



MRHKLNLKDWLIRLGLIWFLVTFIIYPNFDLVVNVFVKGGEFSLDAVHRVLKSQRALQSIMNSFKLAFSLIITVNVVGIL 
CVLFTEYFDIKGAKILKLGYMTSLIYGGVVLATGYKFVYGPYGLITKFLQNVIPSLDPNWFIGYGAVLFIMTFSGTANHT 
50 LFLTNTIRSVDYHTIEAARNMGAKPFTVFRKVVLPTLIPTLFALTIMVFLSGLSAVAAPMIVGGKEFQTINPMIITFAGMG 
NSRDLAALLAIILGIATTILLTIMNKIEKGGNYISISKTKAPLKKQKIASKPWNIIAHIVAYGLFTVFMLPLIFIVLYSFTDPV 
AIQTGNLTLSNFTLENYRLFFSNSAAFSPFLVSFIYSIIAATTATILAVVFARVVRKHKSRFDFLFEYGALLPWLLPSTLLA 
VSLLFTFNQPQFLVLNQILVGSLVILLIAYIVVKIPFSYRMVRAILFSVDDEMEDAARSMGASPFYTMMKVIIPFILPVVLS 
VIALNFNSLLTDFDLSVFLYHPLAQPLGITIRSAGDETATSNAQALVFVYTIVLMIISGTVLYFTQRPGRKVRKZ 
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Table 3 

ID201 > 4106.4 

5 ATGATAAAAAATCCTAAATTATTAACCAAGTCTTTTTTAAGAAGTTTTGCAATTCTAGGTGGTGTTGGTCTAGTCAT 
TCATATAGCTATTTATTTGACCTTTCCTTTTTATTATATTCAACTGGAGGGGGAAAAGTTTAATGAGAGCGCAAGAG 
TGTTTACGGAGTATTTAAAGACTAAGACATCTGATGAAATTCCAAGCTTACTCCAGTCTTATTCAAAGTCCTTGACC 
ATATCTGCTCACCTTAAAAGAGATATTGTAGATAAGCGGCTCCCTCTTGTGCATGACTTGGATATTAAAGATGGAAA 
GCTATCAAATTATATCGTGATGTTAGATATGTCTGTTAGTACAGCAGATGGTAAACAGGTAACCGTGCAATTTGTTC 

10 ACGGGGTGGATGTCTACAAAGAAGCAAAGAATATTTTGCTTTTGTATCTCCCATATACATTTTTGGTTACAATTGCT 
TTTTCCTTTGTTTTTTCTTATTTTTATACTAAACGCTTGCTCAATCCTCTTTTTTACATTTCAGAAGTGACTAGTAA 
AATGCAAGATTTGGATGACAATATTCGTTTTGATGAAAGTAGGAAAGATGAAGTTGGTGAAGTTGGAAAACAGATTA 
ATGGTATGTATGAGCACTTGTTGAAGGTTATTTATGAGTTGGAAAGTCGTAATGAGCAAATTGTAAAATTGCAAAAT 
CAAAAGGTTTCCTTTGTCCGCGGAGCATCACATGAGTTGAAAACCCCTTTAGCCAGTCTTAGAATTATCCTAGAGAA 

15 TATGCAGCATAATATTGGAGATTACAAAGATCATCCAAAATATATTGCAAAGAGTATAAATAAGATTGACCAGATGA 
GCCACTTATTAGAAGAAGTACTGGAGTCTTCTAAATTCCAAGAGTGGACAGAGTGTCGTGAGACCTTGACTGTTAAG 
CCAGTTTTAGTAGATATTTTATCACGTTATCAAGAATTAGCTCATTCAATAGGTGTTACAATTGAAAATCAATTGAC 
AGATGCTACCAGGGTCGTCATGAGTCTTAGGGCATTGGATAAGGTTTTGACAAACCTGATTAGTAATGCAATTAAAT 
ATTCAGATAAAAATGGGCGTGTAATCATATCCGAGCAAGATGGCTATCTCTCTATCAAAAATACATGTGCGCCTCTA 

20 AGTGACCAAGAACTAGAACATTTATTTGATATATTCTATCATTCTCAAATCGTGACAGATAAGGATGAAAGTTCCGG 
TTTGGGTCTTTACATTGTGAATAATATTTTAGAAAGCTATCAAATGGATTATAGTTTTCTCCCTTATGAACACGGTA 
TGGAATTTAAGATTAGCTTGTAG 



25 MIKNPKLLTKSFLRSFAILGGVGLVIHIAIYLTFPFYYIQLEGEKFNESARVFTEYLKTKTSDEIPSLLQSYSKSLT 
ISAHLKRDIVDKi^LPLVHDLDIKDGKIjSNYIVMLDMSVSTADGKQVTVQFVHGVDVYKEAKNILLLYLPYT 
FSFVFSYFYTKRLLNPLFYISEVTSKMQDLDDNIRFDESRKDEVGEVGKQINGMYEHLLKVIYELESRNEQIVKLQN 
QKVSFVRGASHELKTPLASLRIILENMQHNIGDYKDHPKYIAKBINKIDQMSHLLEEVLESSKFQEWTECRETLTVK 
PVLVD I LSRYQELAHS I GVT I ENQLTDATRWMS LRALDKVLTNL I SNAI KY SDKNGRVI I S EQDGYLS I KNTCAPL 

30 SDQELEHLFDIFYHSQIVTDKDESSGLGLYIVNNILESYQMDYSFLPYEHGMEFKISLZ 

ID202 - 4106.9 

ATGGATAAAATTATTAAAACTATATCAGAAAGCGGAGCCTTTCGTGCTTTTGTCCTTGATAGCACTGAAACCGTCCG 
35 CACTGCTCAAGAAAAACATCAAACCCAAGCTAGCTCAACTGTAGCGCTTGGTCGAACTCTTATCGCTAGCCAGATTC 
TCGCAGCCAATGAAAAAGGAAATACCAAACTTACAGTTAAGGTGTTGGGATCTAGCTCTCTAGGTGCTATTATCACC 
GTCGCTGATACCAAGGGGAACGTCAAAGGCTATGTTCAAAATCCTGGTGTTGACATCAAAAAGACTGCGACTGGTGA 
AGTCCTAGTCGGACCTTTTGTTGGAAATGGTCAATTCCTCGTTATCACAGACTACGGTACTGGAAATCCTTACAACT 
CTATAACTCCCCTCATCTCTGGAGAAATCGGTGAAGACCTTGCCTTTTACCTTACTGAAAGCCAACAAACGCCTTCA 
40 GCGGTCGGCCTCAATGTCCTTTTGGACGAGGAAGACAAGGTCAAGGTTGCAGGTGGTTTCCTAGTTCAAGTCTTGCC 
AGGAGCCAAGAAAGAAGAGATTGCTCGCTTTGAAAAACGCATCCAAGAAATGCCAGCTATCTCTACTCTTGTCGAAA 
GCGACGACCATATCGAAGCCCTCCTCAAGGCTATCTACGGGGACGAAGCCTACAAGCGTCTTTCTGAAGAAGAAATC 
CGTTTCCAATGTGACTGTAGCCATGAACGCTTTATGAACGCTCTTGCCAGCCTTCCAAGCTCAGACTTACAGGAAAT 
GAAAGAGGAAGACCACGGGGCAGAAATCACTTGTCAATTCTGCCAAACTACTTACAACTTTGATGAAAAGGACCTGG 
45 AGGAACTCATTCGTGACAAATCTTAA 



I^KIIKTISESGAFRAFVLDSTETVRTAQEKHQTQASSTVALGRTLIASQILAANEKGNTKLTVICVLGSSSLGAIIT 
VADTKGNVKGYVQNPGVDIKKTATGEVLVGPFVGNGQFLVITDYGTGNPYNSITPLISGEIGEDLAFYLTESQQTPS 
50 AVGLNVLLDEEDKVKVAGGFLVQVLPGAKKEEIARFEKRIQEMPAISTLLESDDHIEALLKAIYGDEAYKRLSEEEI 
RFQCDCSHERFMNALASLPSSDLQEMKEEDHGAEITCQFCQTTYNFDEKDLEELIRDKSZ 



ID203 - 4115 

55 

ATGAAATCAATAACTAAAAAGATTAAAGCAACTCTTGCAGGAGTAGCTGCCTTGTTTGCAGTATTTGCTCCATCATT 
TGTATCTGCTCAAGAATCATCAACTTACACTGTTAAAGAAGGTGATACACTTTCAGAAATCGCTGAAACTCACAACA 
CAACAGTTGAAAAATTGGCAGAAAACAACCACATTGATAACATTCATTTGATTTATGTTGATCAAGAGTTGGTTATC 
GATGGCCCTGTAGCGCCTGTTGCAACACCAGCGCCAGCTACTTATGCGGCACCAGCCGCTCAAGATGAAACTGTTTC 



SUBSTITUTE SHEET (RULE 26) 



wo 00/06737 



PCT/GB99/02451 



89 



AGCTCCAGTAGCAGAAACTCCAGTAGTAAGTGAAACAGTTGTTTCAACTGTAAGCGGATCTGAAGCAGAAGCCAAAG 
AATGGATCGCTCAAAAAGAATCAGGTGGTAGTATACAGCTACAAATGGACGTTATATCGGACGTTACCAATTAA 



MKSITKKIKATLAGVAALFAVFAPSFVSAQESSTYTVKEGDTLSEIAETHNTTVEKLAENNHIDNIHLIYVDQELVI 
DGPVAPVATPAPATYAAPAAQDETVSAPVAETPWSETWSTVSGSEAEAKEWIAQKESGGSIQLQMDVISDVTNZ 

ID204 - 4117.1 

ATGAATTTAGGAGAATTTTGGTACAATAAAATAAATAAGAACAGAGGAAGAAGGTTAATGAAGAAAGTAAGATTTAT 
TTTTTTAGCTCTGCTATTTTTCTTAGCTAGTCCAGAGGGTGCAATGGCTAGTGATGGTACTTGGCAAGGAAAACAGT 
ATCTGAAAGAAGATGGCAGTCAAGCAGCAAATGAGTGGGTTTTTGATACTCATTATCAATCTTGGTTCTATATAAAA 
GCAGATGCTAACTATGCTGAAAATGAATGGCTAAAGCAAGGTGACGACTATTTTTACCTCAAATCTGGTGGCTATAT 
GGCCAAATCAGAATGGGTAGAAGACAAGGGAGCCTTTTATTATCTTGACCAAGATGGAAAGATGAAAAGAAATGCTT 
GGGTAGGAACTTCCTATGTTGGTGCAACAGGTGCCAAAGTAATAGAAGACTGGGTCTATGATTCTCAATACGATGCT 
TGGTTTTATATCAAAGCAGATGGACAGCACGCAGAGAAAGAATGGCTCCAAATTAAAGGGAAGGACTATTATTTCAA 
ATCCGGTGGTTATCTACTGACAAGTCAGTGGATTAATCAAGCTTATGTGAATGCTAGTGGTGCCAAAGTACAGCAAG 
GTTGGCTTTTTGACAAACAATACCAATCTTGGTTTTACATCAAAGAAAATGGAAACTATGCTGATAAAGAATGGATT 
TTCGAGAATGGTCACTATTATTATCTAAAATCCGGTGGCTACATGGCAGCCAATGAATGGATTTGGGATAAGGAATC 
TTGGTTTTATCTCAAATTTGATGGGAAAATGGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACT 
ACTTCAAATCCGGTGGTTACATGACAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTATCTCAAATCTGAT 
GGGAAAATAGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCCGGTGGTTACAT 
GACAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTACCTCAAATCTGATGGGAAAATAGCTGAAAAAGAAT 
GGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCTGGTGGCTACATGGCGAAAAATGAGACAGTAGAT 
GGTTATCAGCTTGGAAGCGATGGTAAATGGCTTGGAGGAAAAACTACAAATGAAAATGCTGCTTACTATCAAGTAGT 
GCCTGTTACAGCCAATGTTTATGATTCAGATGGTGAAAAGCTTTCCTATATATCGCAAGGTAGTGTCGTATGGCTAG 
ATAAGGATAGAAAAAGTGATGACAAGCGCTTGGCTATTACTATTTCTGGTTTGTCAGGCTATATGAAAACAGAAGAT 
TTACAAGCGCTAGATGCTAGTAAGGACTTTATCCCTTATTATGAGAGTGATGGCCACCGTTTTTATCACTATGTGGC 
TCAGAATGCTAGTATCCCAGTAGCTTCTCATCTTTCTGATATGGAAGTAGGCAAGAAATATTATTCGGCAGATGGCC 
TGCATTTTGATGGTTTTAAGCTTGAGAATCCCTTCCTTTTCAAAGATTTAACAGAGGCTACAAACTACAGTGCTGAA 
GAATTGGATAAGGTATTTAGTTTGCTAAACATTAACAATAGCCTTTTGGAGAACAAGGGCGCTACTTTTAAGGAAGC 
CGAAGAACATTACCATATCAATGCTCTTTATCTCCTTGCCCATAGTGCCCTAGAAAGTAACTGGGGAAGAAGTAAAA 
TTGCCAAAGATAAGAATAATTTCTTTGGCATTACAGCCTATGATACGACCCCTTACCTTTCTGCTAAGACATTTGAT 
GATGTGGATAAGGGAATTTTAGGTGCAACCAAGTGGATTAAGGAAAATTATATCGATAGGGGAAGAACTTTCCTTGG 
AAACAAGGCTTCTGGTATGAATGTGGAATATGCTTCAGACCCTTATTGGGGCGAAAAAATTGCTAGTGTGATGATGA 
AAATCAATGAGAAGCTAGGTGGCAAAGATTAG 



I^LGEFWYNKINKNRGRRLMKKVRFIFIJ^LLFFLASPEGAMASDGTWQGKQYLKEDGSQAANEOTFDTHYQSWFY 
ADANYAENEWLKQGDDYFYLKSGGYMAKSEWEDKGAFYYLDQDGKMKRNAWGTSYVGATGAKVIEDWVYDSQYDA 
WF Y I KADGQHAE KEWLQ I KGKDY YFKSGGYLLTS QW I NQAYVNASGAKVQQGWLFDKQYQS WFYI KENGNYADKEW I 
FENGHYYYLKSGGYMAANEWIWDKESWFYLKFDGKMAEKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSD 
GKIAEKEWYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKIAEKEWYDSHSQAWYYFKSGGYMAKNETVD 
GYQLGS DGKWLGGKTTNENAAYYQWPVTANVYDSDGE KLS Y I SQGS VWLDKJDRKSDDKRLAI T I SGLSGYMKTED 
LQALDASKDFI PYYESDGHRFYHYVAQNAS I PVASHLSDMEVGKKYYSADGLHFDGFKLENPFLFKDLTEATNYSAE 
ELDKVFSLLNINNSLLENKGATFKEAEEHYHINALYLLAHSALESNWGRSKIAKDKNNFFGITAYDTTPYL 
DVDKGILGATKWIKENYIDRGRTFLGNKASGMNVEYASDPYWGEKIASVMMKINEKLGGKDZ 

ID205 - 4118,1 

ATGAAAAAATTAGGTACATTACTCGTTCTCTTTCTTTCTGCAATCATTCTTGTAGCATGTGCTAGCGGAAAAAAAGA 
TACAACTTCTGGTCAAAAACTAAAAGTTGTTGCTACAAACTCAATCATCGCTGATATTACTAAAAATATTGCTGGTG 
ACAAAATTGACCTTCATAGTATCGTTCCGATTGGGCAAGACCCACACGAATACGAACCACTTCCTGAAGACGTTAAG 
AAAACTTCTGAGGCTAATTTGATTTTCTATAACGGTATCAACCTTGAAACAGGTGGCAATGCTTGGTTTACAAAATT 
GGTAGAAAATGCCAAGAAAACTGAAAACAAAGACTACTTCGCAGTCAGCGACGGCGTTGATGTTATCTACCTTGAAG 
GTCAAAATGAAAAAGGAAAAGAAGACCCACACGCTTGGCTTAACCTTGAAAACGGTATTATTTTTGCTAAAAATATC 
GCCAAACAATTGAGCGCCAAA.GACCCTAACAATAAAGAATTCTATGAAAAAAATCTCAAAGAATATACTGATAAGTT 
AGACAAACTTGATAAAGAAAGTAAGGATAAATTTAATAAGATCCCTGCTGAAAAGAAACTCATTGTAACCAGCGAAG 
GAGCATTCAAATACTTCTCTAAAGCCTATGGTGTCCCAAGTGCTTACATCTGGGAAATCAATACTGAAGAAGAAGGA 
ACTCCTGAACAAATCAAGACCTTGGTTGAAAAACTTCGCCAAACAAAAGTTCCATCACTCTTTGTAGAATCAAGTGT 
GGATGACCGTCCAATGAAAACTGTTTCTCAAGACACAAACATCCCAATCTACGCTCAAATCTTTACTGACTCTATCG 



SUBSTITUTE SHEET (RULE 26) 



wo 00/06737 



PCT/GB99/02451 



90 



CAGAACAAGGTAAAGAAGGCGACAGCTACTACAGCATGATGAAATACAACCTTGACAAGATTGCTGAAGGATTGGCA 
AAATAA 



MKKLGTLLVLFLSAI I LVACASGKKDTTSGQKLKWATNS I IAD I TKNI AGDKI DLHS I VP IGQDPHE YE PLPEDVK 
KTSEANLIFYNGINLETGGNAWFTKLVENAKKTENKDYFAVSDGVDVIYLEGQNEKGKEDPHAWLNLENGIIFAKNI 
AKQLSAKDPNNKEFYEKNLKEYTDKLDKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWEINTEEEG 
TPEQIKTLVEKIiRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAEQGKEGDSYYSMMKYNLDKIAEGLA 
K2 

ID206 - 4119.1 

ATGGAATGGTATAAAAAAATCGGACTTCTTGCAACTACAGGTTTAGCTTTGTTTGGGCTCGGCGCTTGCTCCAACTA 
TGGTAAATCTGCGGATGGCACAGTGACCATCGAGTATTTCAACCAGAAAAAAGAAATGACCAAAACCTTGGAAGAAA 
TCACTCGTGATTTTGAGAAGGAAAACCCTAAGATCAAGGTCAAAGTCGTCAATGTACCAAATGCTGGTGAAGTATTG 
AAGACACGCGTTCTCGCAGGAGATGTGCCTGATGTGGTCAATATTTACCCACAGTCCATCGAACTGCAAGAATGGGC 
AAAAGCAGGTGTTTTTGAAGATTTGAGCAACAAAGACTACCTGAAACGCGTGAAAAATGGCTACGCTGAAAAATATG 
CTGTAAACGAAAAAGTTTACAACGTTCCTTTTACAGCTAATGCTTATGGAATTTACTACAACAAAGATA?VATTCGAA 
GAACTGGGCTTGAAGGTTCCTGAAACCTGGGATGAATTTGAACAGTTAGTCAAAGATATCGTTGCTAAAGGACAAAC 
ACCATTTGGAATTGCAGGTGCAGATGCTTGGACACTCAATGGTTACAATCAATTAGCCTTTGCGACAGCAACAGGTG 
GAGGAAAAGAAGCAAATCAATACCTTCGTTATTCTCAACCAAATGCCATTAAATTGTCGGATCCGATTATGAAAGAT 
GATATCAAGGTCATGGACATCCTTCGCATCAATGGATCTAAGCAAAAGAACTGGGAAGGTGCTGGCTATACCGATGT 
TATCGGAGCCTTCGCACGTGGGGATGTCCTCATGACACCAAATGGGTCTTGGGCGATCACAGCGATTAATGAACAAA 
AACCGAACTTTAAGATTGGGACCTTCATGATTCCAGGAAAAGAAAAAGGACAAAGCTTAACCGTTGGTGCGGGAGAC 
TTGGCATGGTCTATCTCAGCCACCACCAAACATCCAAAAGAAGCCAATGCCTTTGTGGAATATATGACCCGTCCAGA 
AGTCATGCAAAAATACTACGATGTGGACGGATCTCCAACAGCGATCGAAGGGGTCAAACAAGCAGGAGAAGATTCAC 
CGCTTGCTGGTATGACCGAATATGCCTTTACGGATCGTCACTTGGTCTGGTTGCAACAATACTGGACCAGTGAAGCA 
GACTTCCATACCTTGACCATGAACTATGTCTTGACCGGTGATAAACAAGGCATGGTCAATGATTTGAATGCCTTCTT 
TAACCCGATGAAAGCGGATGTGGATTAG 



MEWYKICIGLLATTGI^LFGLGACSNYGKSADGTVTIEYFNQKKEMTKTLEEITRDFEKENPKIKVKVVNVPNAGEVL 
KTRVLAGDVPDVVNIYPQSIELQEWAKAGVFEDLSNKDYLKRVKNGYAEKYAVNEKVYNVPFTANAYGIYYNKDKFE 

ELGLKVPETWDEFEQLVKDIVAKGQTPFGIAGADAWTLNGYNQLAFATATGGGKEANQYLRYSQPNAIKLSDPIMKD 
DIKVMDILRINGSKQKNWEGAGYTDVIGAFARGDVLMTPNGSWAITAINEQKPNFKIGTFMIPGKEKGQSLTVGAGD 
LAWSISATTKHPKEANAFVEYMTRPEVMQKYYDVDGSPTAIEGVKQAGEDSPIiAGMTEYAFTDRHLVWLQQYWTSEA 
DFHTLTMNYVLTGDKQGMVNDLNAFFNPMKADVDZ 

ID207 - 4123.1 

ATGAAGAAAATCAAACCGCATGGACCGTTACCAAGTCAGACTCAGCTAGCTTATCTGGGAGATGAACTAGCAGCTTT 
TATCCACTTCGGTCCTAATACCTTTTATGACCAAGAATGGGGGACTGGACAGGAGGATCCTGAGCGCTTTAACCCGA 
GTCAGTTGGATGCGCGTGAGTGGGTTCGTGTGCTCAAGGAAACGGGCTTCAAAAAGTTGATTTTGGTGGTCAAGCAC 
CACGATGGCTTTGTCCTTTATCCGACAGCTCACACAGATTATTCGGTTAAGGTCAGTCCTTGGAGGAGAGGAAAGGG 
CGACTTGCTCCTTGAAGTATCCCAAGCTGCCACAGAGTTTGATATGGATATGGGGGTCTACCTGTCACCGTGGGATG 
CCCATAGTCCCCTCTATCATGTGGACCGAGAAGCGGACTACAATGCCTATTATCTGGCTCAGTTGAAGGAAATCTTA 
TCAAATCCTAACTATGGGAATGCTGGTAAGTTCGCTGAGGTTTGGATGGATGGTGCCAGAGGAGAGGGCGCGCAAAA 
GGTTAATTATGAATTTGAAAAATGGTTTGAAACCATTCGTGACCTGCAGGGCGATTGCTTGATTTTTTCAACAGAAG 
GCACCAGTATCCGCTGGATTGGCAATGAACGAGGGTATGCAGGTGATCCACTGTGGCAAAAGGTGAATCCTGATAAA 
CTAGGAACAGAAGCAGAGCTGAACTATCTTCAGCACGGGGATCCCTCGGGCACGATTTTTTCAATCGGAGAGGCAGA 
TGTTTCCATCCGTCCAGGCTGGTTCTACCATGAGGATCAGGATCCTAAGTCTCTCGAGGAGTTGGTCGAAATCTACT 
TTCACTCAGTAGGGCGAGGAACTCCACTCTTGCTTAATATTCCGCCGAATCAAGCTGGGCTCTTTGATGCAAAGGAT 
ATTGAACGACTTTATGAATTTGCGACCTATCGCAATGAGCTCTATAAAGAAGATTTGGCTCTGGGAGCTGAGGTATC 
TGGTCCAGCTCTTTCCGCAGACTTTGCTTGTCGCCATTTGACAGACGGCCTTGAGACCAGCTCTTGGGCAAGCGATG 
CAGACTTGCCCATCCAGTTAGAACTCGACTTAGGTTCTCCTAAAACTTTTGATGTAATTGAGTTAAGAGAAGATTTG 
AAGCTAGGGCAACGAATCGCTGCTTTTCATGTGCAAGTAGAGGTGGATGGTGTCTGGCAGGAGTTTGGTTCGGGTCA 
TACTGTTGGTTACAAACGTCTCTTACGAGGAGCAGTTGTTGAGGCACAGAAGATACGTGTAGTCATTACAGAATCAC 
AGGCTTTGCCTTTGTTGACCAAGATTTCCCTTTATAAAACTCCTGGATTATCAAAAAAAGAAGTTGTTCAGGAACTA 
GCATTTGCAGAAAAAAGCCTAGCTGTGGCAAAGGGAGAAAATGCCTATTTTACAGTTAAGCGCAGAGAATGTAGTGG 
TCCTTTAGAAGCTAAGATTTCGATTCAACCGGGGACAGGTGTCCATGGTGTCGCCTATCAGGATGAGATTCAAGTCC 
TTGCGTTTCAAAGTGGTGAGACTGAAAAAAGTCTGACGCTACCAACCTTGTATTTCGCAGGAGATAAAACCTTGGAT 
TTCTATCTGAACCTAACGGTGGATGGTCAGCTTGTGGATCAACTTCAAGTCCAAGTTTCATAA 
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MKKIKPHGPLPSQTQIiAYLGDELAAFIHFGPNTFYDQEWGTGQEDPERFNPSQLDAREWVRVLKETGFKKLILWKH 

HDGFVL YPTAHTD YSVKVS PWRRGKGDLLLE VSQAATE FDMDMGVYLS PWDAHS PL YHVDREAD YNAYYLAQLKE I L 
SNPNYGNAGKFAEWMDGARGEGAQKVNYEFEKWFETIRDLQGDCLIFSTEGTSIRWIGNERGYAGDPLWQKVNPDK 
5 LGTEAELNYLQHGDPSGTIFSIGEADVSIRPGWFYHEDQDPKSLEELVEIYFHSVGRGTPLLLNIPPNQAGLFDAKD 
lERLYEFATYRNELYKEDLALGAEVSGPALSADFACRHLTDGLETSSWASDADLPIQLELDLGSPKTFDVIELREDL 
KLGQRIAAFHVQVEVDGVWQEFGSGHTVGYKRLLRGAWEAQKIRWITESQALPLLTKISLYKTPGLSKKEWQEL 
AFAEKS LAVAKGENAYFTVKRRECSGPLEAKI S I Q PGTGVHGVAYQDE I QVLAFQTGETEKS LTLPTLYFAGDKTLD 
F YLNLTVDGQLVDQLQVQVS Z 

10 

ID208 - 4125,12 

ATGCTTGAAAGACTGAAAAGAATACATTATATGTTTTGGATCAGTTTAATTTTTATGATTTTCCCCATCCTGTCTGT 
AGTGACTGGGTGGCTTTCTGCCTGGCATTTATTGATTGATATTCTATTTGTAGTGGCATATTTGGGTGTTTTAACAA 
CTAAGAGCCAGCGCCTATCTTGGCTATATTGGGGCCTCATGCTGACTTATGTAGTTGGGAATACTGCCTTTGTTGCT 

15 GTTAATTATATCTGGTTTTTCTTTTTCCTATCCAATCTCTTAAGTTATCATTTCAGCGTACGTAGTTTAAAGTCTTT 
ACATGTCTGGACTTTTCTTCTTGCTCAAGTCCTTGTTGTGGGGCAACTGTTGATTTTTCAGAGAATCGAAGTTGAGT 
TTCTATTCTATCTACTTGTAATTCTTACTTTTGTCGATTTAATGACTTTTGGATTGGTTCGGATTCGTATTGTCGAG 
GATTTGAAAGAAGCTCAGGTCAAGCAAAATGCTCAGATAAATCTATTGCTTGCTGAAAATGAACGTAGTCGTATCGG 
TCAGGATTTGCATGATAGTCTGGGACATACCTTTGCTATGCTGAGTGTCAAGACAGATTTAGCCTTGCAGTTATTTC 

20 AGATGGAGGCTTATCCACAGGTGGAAAAGGAATTAAAAGAAATTCACCAGATCAGCAAGGATCCATGA 



MLERLKRIHYMFWISLIFMIFPILSWTGWLSAWHLLIDILFWAYLGVLTTKSQRLSWLYWGLMLTYWGNTAFVA 
VNYIWFFFFLSNLLSYHFSVRSLKSLHVWTFLLAQVLWGQLLIFQRIEVEFLFYLLVILTFVDLMTFGLVRIRIVE 
25 DLKEAQVKQNAQINLLIJ^ENERSRIGQDLHDSLGHTFAMLSVKTDLALQLFQMEAYPQVEKELKEIHQISKDPZ 

ID209 - 4126.3 

ATGAATGATAAGTTAAAAATCTTCTTGTTGCTAGGAGTATTTTTTCTAGCCATAACCGGTTTCTATGTTCTATTGAT 
ACGAAATGCAGGGCAGACAGATGCCTCGCAAATTGAAAAGGCGGCAGTTAGCCAAGGAGGAAAAGCAGTGAAAAAAA 

30 CAGAAATTAGTAAAGACGCAGACTTGCACGAAATTTATCTAGCTGGAGGTTGTTTCTGGGGAGTGGAGGAATATTTC 
TCACGTGTTCCCGGGGTGACGGATGCCGTTTCAGGCTATGCAAATGGTAGAGGAGAAACAACCAAGTACGAATTGAT 
TAACCAAACAGGTCATGCAGAAACCGTCCATGTCACCTATGATGCCAAGCAAATTTCTCTCAAGGAAATCCTGCTTC 
ACTATTTCCGCATTATCAATCC^CCAGCAAAAATAAACAAGGAAATGATGTGGGGACCCAGTACCGTACTGGTGTT 
TATTACACAGATGACAAGGATTTGGAAGTGATTAACCAAGTCTTTGATGAGGTGGCTAAGAAATACGATCAACCTCT 

35 AGCAGTTGAAAAGGAAAACTTGAAGAATTTTGTGGTGGCTGAGGATTACCATCAAGACTATCTCAAGAAAAATCCAA 
ATGGCTACTGCCATATCAATGTTAATCAGGCGGCCTATCCTGTCATTGATGCCAGCAAATATCCAAAACCAAGTGAT 
GAGGAATTGAAAAAGACCCTGTCACCTGAGGAGTATGCAGTTACCCAGGAAAATCAAACAGAACGAGCTTTCTCAAA 
CCGTTACTGGGATAAATTTGAATCCGGTATCTATGTGGATATAGCAACTGGGGAACCTCTCTTTTCATCAAAAGACA 
AATTTGAGTCTGGTTGTGGCTGGCCTAGTTTTACCCAACCCATCAGTCCAGATGTTGTCACCTACAAGGAAGATAAG 

40 TCCTACAATATGACGCGTATGGAAGTGCGGAGCCGAGTAGGAGATTCTCACCTTGGGCATGTCTTTACGGATGGTCC 
ACAGGACAAGGGCGGCTTACGTTACTGTATCAATAGCCTCTCTATCCGCTTTATTCCCAAAGACCAAATGGAAGAAA 

AAGgcTACGCTTATTTACTAGATTATGTTGATTAA 

45 MNDKIiKI FLLLGVFFLAI TGF YVLL I RNAGQTDASQ I E KAAVSQGGKAVKKTE I S KDADLHE I YIoAGGCFWGVEE YF 

SRVPGVTDAVSGYANGRGETTKYELINQTGHAETVHVTYDAKQISLKEILLHYFRIINPTSICNKQGNDVGTQYRTGV 

YYTDDKDLEVINQVFDEVAKKYDQPLAVEKENLKNFVVAEDYHQDYLKKNPNGYCHINVNQAAYPVIDASKYPKPSD 

EELKKTLSPEEYAVTQENQTERAFSNRYWDKFESGIYVDIATGEPLFSSKDKFESGCGWPSFTQPISPDWTYKEDK 

SYNMTRMEVRSRVGDSHLGHVFTDGPQDKGGLRYCINSLS IRFI PKDQMEEKGYAYLLDYVDZ 

50 

ID210 - 4127.1 

ATGAAAAAGAAATGGATGTATTATGCTGCTTGTTCTTCTAATGAATCTGCCGATGACAGTTCATCTGATAAAGGAGA 
CGGCGGTTCGCTAGTCGTTTATTCACCAAACTCAGAGGGCTTAATTGGAGCAACTATTCCTGCCTTTGAAGAAAAAT 
ATGGTATCAAAGTAGAACTGATTCAAGCTGGTACTGGAGAACTTTTCAAAAAACTAGAGTCAGAAAAAGAAGTTCCT 

55 GTAGCTGATGTTATCTTTGGTGGTTCTTATACACAATATACTACCCACGGAGAACTCTTTGAAAACTATACTTCAAA 
AGAAAATGATAATGTTATCAAAGAATATCAAAACACAACTGGCTACTCTACTCCTTATACACTAGATGGTAGTGTTT 
TAATCGTCAACCCTGATTTAACTAAAGGCATGAACATCGAAGGATATAACGATCTTTTCAAACCTGAACTAAAAGGA 
AAAATCGCAACTGCTGACCCAGCAAACTCTTCTAGCGCCTTTGCTCAATTAACAAATATGCTACAAGCTCAAGGTGG 
TTACAAAGATGATAAGGCTTGGTCTTATGTAAAAGATCTTTTCACACTTATTGATGGTAAAATCGGTTCAAGTTCAT 

60 CTAGTGTCTATAAAGTAGTCGCTGATGGAGAAATGGCTGTTGGTCTCTCTTATGAAGATCCAGCAGTTAAACTCTTA 
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AATGACGGAGCTAACATTAAGGTAGTCTATCCAAAAGAAGGAACCGTCTTCCTACCTGCTAGTGCTGCTATCGTTAA 
AAAATCTAAAAATATGGAAAATGCCAAGAAATTTATCGATTTTATTATCTCTCAAGAAGTACAAGATACACTTGGTA 
CAACCACTACTAACCGTCCTGTTCGTAAAAATGCTAAAACAAGCGAAAACATGAAACCAATTGACAAAATCAAAACA 
CTCACTGAAGATTATGATTATGTCATCAAGAATAAATCAGATATCGTTAAGAAATACAACGAAGTCTTTACAGATAT 
CCAATCTAAACAGTAA 



MKKKWMYYAACSSNESADDSSSDKGDGGSLWYSPNSEGLIGATIPAFEEKYGIKVELIQAGTGELFKKLESEKEVP 
VADVIFGGSYTQYTTHGELFENYTSKENDNVIKEYQNTTGYSTPYTLDGSVLIVNPDLTKGMNIEGYNDLFKPELKG 
KI ATADPANS S S AFAQLTNMLQAQGG YKDDKAWS YVKDLFTL I DGKI GSS S S S VYKWADGEMAVGLS YED P AVKLL 
lOTGANIK^YPKEGTVFLPASAAIVKKSKNMENAKKFIDFIISQEVQDTLGTTTTNRPVRKNAKTSENMKPIDKIKT 
LiTEDYDYVIKNKSDIVKKYNEVFTDIQSKQZ 

ID211 ■ 4127.2 

ATGAGTGAGATCAAAATTATTAACGCCAAAAAAATCTACCACGATGTCCCTGTTATTGAGAATTTGAACATTACAAT 
TCCAAAAGGAAGTCTCTTTACCCTTCTTGGAGCTTCAGGATGTGGGAAAACGACCCTTCTTCGTATGATTGCAGGTT 
TCAACAGTATCGAAGGTGGAGAATTTTACTTCGATGATACAAAAATCAATAATATGGAACCCAGCAAACGCAATATC 
GGGATGGTTTTCCAAAACTACGCTATTTTCCCACATTTGACTGTCCGAGACAACGTTGCTTTTGGTCTTATGCAAAA 
GAAGGTTCCAAAAGAAGAATTGATTCAACAGACCAACAAGTATCTTGAACTCATGCAAATTGCTCAATATGCGGATC 
GAAAGCCCGATAAACTCAGTGGTGGACAACAACAACGTGTCACCTTGGCATGCGCCTTAGCGGTTAATCCAAGTGTT 
CTCCTCATGGACGAGCCACTTAGTAATCTGGAGGCCAAACTTCGCTTGGATATGCGTCAAGCCATCCGAGAAATCCA 
ACACGAAGTGGGAATTACAACTGTTTATGTAACCCACGACCAAGAAGAAGCCATGGCTATTTCAGACCAAATTGCTG 
TTATGAAAGATGGGGTGATCCAACAAATCGGCCGACCAAAAGAACTCTATCATAAACCAGCTAATGAGTTTGTGGCA 
ACCTTTATCGGACGCACAAATATTATCCCTGCCAATCTTGAAAAACGGAGCGACGGCGCTTATATCGTCTTTTCAGA 
TGGCTATGCCCTTCGAATGCCAGCTCTTGATCAGGTTGAGGAGCAAGCTATTCATGTAAGCATTCGTCCCGAAGAGT 
TTATCAAAGATGAATCTGGAGATATTGAAGGAACTATTAGAGATAGCGTCTATCTTGGACTAAATACGGATTATTTC 
ATTGAGACAGGTTTTGCCTCAAAAATTCAAGTTAGTGAAGAATCAACTTTTGAAGAAGATCTACAAAAAGGCAATCG 
TATTCGTCTACGAATCAATACGCAAAAATTAAACATCTTTTCTGCAGATGGTTCCCAAAACCTGATAAAAGGAGTCA 
ACCATGGAACGTAA 

MSEIKIINAKKIYHDVPVIENLNITIPKGSLFTLLGASGCGKTTLLRMIAGFNSIEGGEFYFDDTKINNMEPSKRNI 

GlWFQNYAIFPHLTVRDOTAFGLMQKKVPraELIQQTNKYLELMQIAQYADRKPDKLSGGQQQRVTIxA.CA]^ 

LLl^EPLSNLEAKLRLDMRQAIREIQHEVGITTVYVTHDQEEAMAISDQIAVMKDGVIQQIGRPKELYHKPANEFVA 

TFIGRTNIIPANLEKRSDGAYIVFSDGYALRMPALDQVEEQAIHVSIRPEEFIKDESGDIEGTIRDSVYLGLNTDYF 

lETGFASKIQVSEESTFEEDLQKGNRIRLRINTQKLNIFSADGSQNLIKGVNHGTZ 

ID212 - 4136.1 

ATGAAGAAAAAATTATTGGCAGGTGCCATCACACTATTATCAGTAGCAACTTTAGCAGCTTGTTCGAAAGGGTCAGA 
AGGTGCAGACCTTATCAGCATGAAAGGGGATGTCATTACAGAACATCAATTTTATGAGCAAGTGAAAAGCAACCCTT 
CAGCCCAACAAGTCTTGTTAAATATGACCATCCAAAAAGTTTTTGAAAAACAATATGGCTCAGAGCTTGATGATAAA 
GAGGTTGATGATACTATTGCCGAAGAAAAAAAACAATATGGCGAAAACTACCAACGTGTCTTGTCACAAGCAGGTAT 
GACTCTTGAAACACGTAAAGCTCAAATTCGTACAAGTAAATTAGTTGAGTTGGCAGTTAAGAAGGTAGCAGAAGCTG 
AATTGACAGATGAAGCCTATAAGAAAGCCTTTGATGAGTACACTCCAGATGTAACGGCTCAAATCATCCGTCTTAAT 
AATGAAGATAAGGCCAAAGAAGTTCTCGAAAAAGCCAAGGCAGAAGGTGCTGATTTTGCTCAATTAGCCAAAGATAA 
TTCAACTGATGAAAAAACAAAAGAAAATGGTGGAGAAATTACCTTTGATTCTGCTTCAACAGAAGTACCTGAGCAAG 
TCAAAAAAGCCGCTTTCGCTTTAGATGTGGATGGTGTTTCTGATGTGATTACAGCAACTGGCACACAAGCCTACAGT 
AGCCAATATTACATTGTAAAACTCACTAAGAAAACAGAAAAATCATCTAATATTGATGACTACAAAGAAAAATTAAA 
AACTGTTATCTTGACTCAAAAACAAAATGATTCAACATTTGTTCAAAGCATTATCGGAAAAGAATTGCAAGCAGCCA 
ATATCAAGGTTAAGGACCAAGCCTTCCAAAATATCTTTACCCAATATATCGGTGGTGGAGATTCAAGCTCAAGCAGT 
AGTACATCAAACGAATAG 

MKKKLLAGAITLLSVATtAACSKGSEGADLISMKGDVITEHQFYEQVKSNPSAQQVLLNMTIQKVFEKQYGSELDDK 
EVDDTIAEEKKQYGENYQRVLSQAGMTLETRKAQIRTSKLVELAVKKVAEAELTDEAYKKAFDEYTPDVTAQIIRLN 
NEDKAKEVLEKAKAEGADFAQLAKDNSTDEKTKENGGE I TFDS ASTEVPEQVKKAAFALDVDGVSDVI TATGTQAYS 

SQYYIVKLTKKTEK^SNIDDYKEKLKTVILTQKQNDSTFVQSIIGKELQAANIKVKDQAFQNIFTQYIGGGDSSSSS 
STSNEZ 
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ID213 - 4137.3 

ATGAAAAAAAATATTAAACAATATGTAACCTTAGGTACTGTAGTGGTATTATCAGCATTTGTTGCTAACTCAGTTGC 
AGCTCAGGAGACTGAAACTTCTGAAGTATCAACACCAAAGTTGGTGCAACCTGTTGCACCAACGACTCCGATTTCGG 
AAGTACAACCTACATCGGATAACTCTTCGGAAGTTACTGTACAACCTCGAACAGTTGAAACTACTGTTAAGGATCCA 
TCTTCTACAGCGGAAGAAACTCCTGTCTTAGAAAAAAATAATGTTACTTTAACAGGGGGCGGAGAAAATGTTACTAA 
AGAGTTAAAGGATAAATTTACTAGCGGTGACTTTACTGTAGTGATTAAGTACAATCAGTCAAGTGAGAAAGGCTTAC 
AAGCTCTGTTTGGAATATCTAATTCCAAACCCGGTCAACAAAATAGTTATGTAGATGTGTTCCTTAGAGACAATGGT 
GAGTTGGGGATGGAAGCGCGTGATACTTCTTCCAATAAAAATAACCTAGTATCCAGACCTGCTTCAGTTTGGGGTAA 
GTACAAACAAGAGGCTGTGACTAACACTGTTGCAGTAGTAGCAGATTCAGTCAAAAAAACATATTCTTTATACGCAA 
ATGGTACAAAAGTAGTAGAAAAGAAAGTGGATAATTTCCTAAACATCAAGGATATTAAAGGTATTGATTACTATATG 
CTTGGGGGAGTGAAACGTGCAGGAAAAACGGCGTTTGGTTTTAACGGAACACTAGAAAATATCAAATTCTTTAATAG 
TGCATTGGATGAAGAAACTGTTAAAAAGATGACAACAAACGCTGTTACTGGACATTTAATTTATACGGCTAATGATA 
CAACAGGTTCTAACTATTTCCGTATTCCAGTTCTGTATACTTTTAGCAATGGTCGGGTATTTTCAAGCATTGACGCT 
CGTTACGGTGGAACTCATGATTTCTTGAATAAAATTAATATTGCTACAAGTTATAGTGATGATAATGGTAAGACATG 
GACTAAACCAAAATTAACATTGGCATTCGATGATTTTGCGCCAGTACCATTAGAATGGCCTCGTGAAGTTGGTGGAC 
GTGACTTACAAATCAGCGGTGGTGCAACCTATATTGACTCTGTTATTGTTGAAAAAAAGAACAAACAAGTACTCATG 
TTTGCTGATGTGATGCCTGCTGGAGTAAGTTTTAGAGAAGCAACTAGAAAAGATTCAGGTTATAAACAAATTGATGG 
TAATTATTACCTTAAATTAAGGAAACAAGGTGATACTGATTACAATTATACTATTCGTGAGAATGGTACTGTATACG 
ACGATCGTACCAACAGACCAACTGAATTTTCAGTAGATAAAAATTTCGGTATTAAACAAAATGGTAATTATTTGACG 
GTAGAGCGG 

MKKNIKQYVTLGTVWLSAFVANSVAAQETETSEVSTPKLVQPVAPTTPISEVQPTSDNSSEVTVQPRWETTVKDP 

SSTAEETPVLEKNNVTLTGGGE^^/TKELKDKFTSGDFTVVIKYNQSSEKGLQALFGISNSKPGQQNSYVDVFLRDNG 

ELGMEARDTSSNK^rNLVSRPASWGKYKQEAVTNWAWADSVKKTYSLYANGTKWEKKVDNFIJNIKDIK^ 

LGGVKRAGKTAFGFNGTLENIKFFNSALDEETVKKMTTNAVTGHLIYTANDTTGSNYFRIPVLYTFSNGRVFSSIDA 

RYGGTHDFLNKINIATSYSDDNGKTWTKPKLTIiAFDDFAPVPLEWPREVGGRDLQISGGATYIDSVIVEKKNKQVLM 

FADVMPAGVSFREATRKDSGYKQIDGNYYLKLRKQGDTDYNYTIRENGTVYDDRTNRPTEFSVDKNFGIKQNGNYLT 

VER 

ID214 - 4185 

ATGAAAAAATTTAGCCTATTACTAGCTATCCTACCATTTTTGGTTGCCTGTGAGAATCAAGCTACACCCAAAGAGAC 
TAGCGCTCAAAAGACAATCGTCCTTGCTACAGCTGGCGACGTGCCACCATTTGACTACGAAGACAAGGGCAATCTGA 
CAGGCTTTGATATCGAAGTTTTAAAGGCAGTAGATGAAAAACTCAGCGACTACGAGATTCAATTCCAAAGAACCGCC 
TGGGAGAGCATCTTCCCAGGACTTGATTCTGGTCACTATCAGGCTGCGGCCAATAACTTGAGTTACACAAAAGAGCG 
TGCTGAAAAATACCTTTACTCGCTTCCAATTTCCAACAATCCCCTCGTCCTTGTCAGCAACAAGAAAAATCCTTTGA 
CTTCTCTTGACCAGATCGCTGGTAAAACAACACAAGAGGATACCGGAACTTCTAACGCTCAATTCATCAATAACTGG 
AATCAGAAACACACTGATAATCCCGCTACAATTAATTTTTCTGGTGAGGATATTGGTAAACGAATCCTAGACCTTGC 
TAACGGAGAGTTTGATTTCCTAGTTTTTGACAAGGTATCCGTTCAAAAGATTATCAAGGACCGTGGTTTAGACCTCT 
CAGTCGTTGATTTACCTTCTGCAGATAGCCCCAGCAATTATATCATTTTCTCAAGCGACCAAAAAGAGTTTAAAGAG 
CAATTTGATAAAGCGCTCAAAGAACTCTATCAAGACGGAACCCTTGAAAAACTCAGCAATACCTATCTAGGTGGTTC 
TTACCTCCCAGATCAATCTCAGTTACAATAA 



MKKFSLLLAILPFLVACENQATPKETSAQKTIVLATAGDVPPFDYEDKGNLTGFDIEVLKAVDEKLSDYEIQFQRTA 
WESIFPGIiDSGHYQAAANNLSYTKERAEKYLYSLPISNNPLVLVSNKKNPLTSLDQIAGKTTQEDTGTSNAQFINNW 
NQKHTDNPATINFSGEDIGKRILDLANGEFDFLVFDKVSVQKIIKDRGLDLSWDLPSADSPSNYIIFSSDQKEFKE 
QFDKALKELYQDGTLEKLSNTYLGGSYLPDQSQLQZ 

ID215 - 4211.1 

ATGAAAAAAAATAGTTTATATATCATATCCTCACTCTTTTTTGCTTGTGTCTTATTTGTCTATGCTACGGCGACGAA 
TTTTCAAAACAGTACCAGTGCTAGGCAGGTAAAAACGGAAACCTATACTAATACAGTAACAAATGTCCCTATTGACA 
TACGCTATAATAGTGATAAGTATTTTATTAGCGGTTTTGCTTCAGAAGTATCAGTGGTCTTGACTGGTGCAAATCGC 
CTATCGCTAGCTAGTGAAATGCAAGAAAGTACACGTAAATTCAAGGTTACTGCTGACCTAACAGATGCCGGTGTTGG 
AACGATTGAAGTTCCTTTGAGCATTGAAGATTTACCCAATGGGCTGACCGCTGTGGCGACTCCGCAAAAAATTACAG 
TCAAGATTGGTAAGAAGGCTCAGAAGGATAAGGTAAAGATTGTACCAGAGATTGACCCTAGTCAAATTGATAGTCGG 
GTACAAATTGAAAATGTCATGGTGTCAGATAAAGAAGTGTCTATTACGAGTGACCAAGAGACATTGGATAGAATTGA 
TAAGATTATCGCTGTTTTGCCAACTAGCGAACGTATAACAGGTAATTACAGTGGTTCAGTACCTTTGCAGGCAATCG 
ACCGCAATGGTGTTGTCTTACCGGCAGTTATCACTCCGTTTGATACAATAATGAAGGTGACTACAAAACCAGTAGCA 
CCAAGTTCAAGCACATCAAATTCAAGTACAAGCAGTTCATCGGAGACATCTTCGTCAACGAAAGCAACTAGTTCAAA 
AACGAATTAA 
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MKKNSLYIISSLFFACVLFVYATATNFQNSTSARQVKTETYTNTVTNVPIDIRYNSDKYFISGFASEVSVVLTGANR 
LSLASEMQESTRKFKVTADLTDAGVGTIEVPLSIEDLPNGLTAVATPQKITVKIGKKAQKDBCVKIVPEIDPSQIDSR 
VQIENVMVSDKEVSITSDQETLDRIDKIIAVLPTSERITGNYSGSVPLQAIDRNGWLPAVITPFDTIMKVTTKPVA 

5 PSSSTSNSSTSSSSETSSSTKATSSKTNZ 
ID216 - 4127.3 

ATGTTGATTGGCGAAGGGTATCGGACTTTCCCTGTCCTGATTTATACCCT^TTTATTAGCGAGGTTGGAGGAAATTC 
TGCTTTTGCAATTATGGCGATTATCATTGCCTTGGCAATTTTCCTTATCCAAAAACACATTGCAAACCGCTACAGTT 

10 TCAGCATGAATCTGCTCCATCCJ^ATTGAGCCTAAAAAAACTACAAAAGGAAAAATGGCTGCCATTTATGCAACAGTC 
TACGGAATTATCTTTATCTCTGTTTTACCTCAAATCTACTTAATTTATACCTCTTTCCTAAAAACATCAGGTATGGT 
ATCTGTTAAAGGTTATTCTCCAAACAGTTACAAGGTAGCTTTCCATCGTATGGGATCTGCTATTTTCAATACCATTC 
GTATCCCTTTGATTGCCTTAGTTCTAGTTGTTCTATTTGCGACATTTATCTCCTACCTAGCCGTTAGAAAACGGAAT 
TTGTTTACAAACTTAATTGACAGCCTCAGTATGGTACCTTATATTGTACCAGGAACCGTTCTAGGGATTGCCTTCAT 

15 TTCTTCCTTCAATACTGGTCTATTTGGAAGTGGATTTCTTATGATTACAGGGACTGCTTTCATCTTGATTATGTCTC 
TATCTGCCAGAAGATTACCTTATACTATTCGCTCATCTGTTGCTAGCTTACAACAAATAGCACCAAGTATTGAAGAA 
GCTGCTGAAAGCTTAGGAAGTAGTCGTCTCAATACCTTTGCTAAGATTACAACTCCAATGATGCTATCTGGTATCAT 

TTCTGGAGCCATCTTATCTTGA 

20 

MLIGEGYRTFPVLIYTQFISEVGGNSAFAIMAIIIALAIFLIQKHIANRYSFSMNLLHPIEPKKTTKGKMAAIYATV 
YGIIFISVLPQIYLIYTSFLKTSGMVSVKGYSPNSYKVAFHRMGSAIFNTIRIPLIALVLWLFATFISYLAVRKRN 
LFTNLIDSLSMVPYIVPGTVLGIAFISSFNTGLFGSGFLMITGTAFILIMSLSARRLPYTIRSSVASLQQIAPSIEE 

AAESLGSSRLNTFAKITTPMMLSGI I SGAI LS Z 

25 
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Table 4 

ID3 01 

ATGAATAAGAAAAAAATGATTTTAACAAGTCTAGCCAGCGTCGCTATCTTAGGGGCTGGTTTTGTTACGTCTCAGCC 
TACTTTTGTAAGAGCAGAAGAATCTCCACAAGTTGTCGAAAAATCTTCATTAGAGAAGAAATATGAGGAAGCAAAAG 
CAAAAGCTGATACTGCCAAGAAAGATTACGAAACGGCTAAAAAGAAAGCAGAAGACGCTCAGAAAAAGTATGAAGAT 
GATCAGAAGAGAACTGAGGAGAAAGCTCGAAAAGAAGCAGAAGCATCTCAAAAATTGAATGATGTGGCGCTTGTTGT 
TCAAAATGCATATAAAGAGTACCGAGAAGTTCAAAATCAACGTAGTAAATATAAATCTGACGCTGAATATCAGAAAA 
AATTAACAGAGGTCGACTCTAAAATAGAGAAGGCTAGGAAAGAGCAACAGGACTTGCAAAATAAATTTAATGAAGTA 
AGAGCAGTTGTAGTTCCTGAACCAAATGCGTTGGCTGAGACTAAGAAAAAAGCAGAAGAAGCTAAAGCAGAAGAAAA 
AGTAGCTAAGAGAAAATATGATTATGCAACTCTAAAGGTAGCACTAGCGAAGAAAGAAGTAGAGGCTAAGGAACTTG 
AAATTGAAAAACTTCAATATGAAATTTCTACTTTGGAACAAGAAGTTGCTACTGCTCAACATCAAGTAGATAATTTG 
AAAAAACTTCTTGCTGGTGCGGATCCTGATGATGGCACAGAAGTTATAGAAGCTAAATTAAAAAAAGGAGAAGCTGA 
GCTAAACGCTAAACAAGCTGAGTTAGCAAAAAAACAAACAGAACTTGAAAAACTTCTTGACAGCCTTGATCCTGAAG 
GTAAGACTCAGGATGAATTAGATAAAGAAGCAGAAGAAGCTGAGTTGGATAAAAAAGCTGATGAACTTCAAAATAAA 
GTTGCTGATTTAGAAAAAGAAATTAGTAACCTTGAAATATTACTTGGAGGGGCTGATCCTGAAGATGATACTGCTGC 
TCTTCAAAATAAATTAGCTGCTAAAAAAGCTGAGTTAGCAAAAAAACAAACAGAACTTGAAAAACTTCTTGACAGCC 
TTGATCCTGAAGGTAAGACTCAGGATGAATTAGATAAAGAAGCAGAAGAAGCTGAGTTGGATAAAAAAGCTGATGAA 
CTTCAAAATAAAGTTGCTGATTTAGAAAAAGAAATTAGTAACCTTGAAATATTACTTGGAGGGGCTGATTCTGAAGA 
TGATACTGCTGCTCTTCAAAATAAATTAGCTACTAAAAAAGCTGAATTGGAAAAAACTCAAAAAGAATTAGATGCAG 
CTCTTAATGAGTTAGGCCCTGATGGAGATGAAGAAGAAACTCCAGCGCCGGCTCCTCAACCAGAGCAACCAGCTCCT 
GCACCAAAACCAGAGCAACCAGCTCCAGCTCCAAAACCAGAGCAACCAGCTCCTGCACCAAAACCAGAGCAACCAGC 
TCCAGCTCCAAAACCAGAGCAACCAGCTCCAGCTCCAAAACCAGAGCAACCAGCTAAGCCGGAGAAACCAGCTGAAG 
AGCCTACTCAACCAGAAAAACCAGCCACTCCAAAAACAGGCTGGAAACAAGAAAACGGTATGTGGTATTTCTACAAT 
ACTGATGGTTCAATGGCAATAGGTTGGCTCCAAAACAACGGTTCATGGTACTACCTAAACGCTAACGGCGCTATGGC 
AACAGGTTGGGTGAAAGATGGAGATACCTGGTACTATCTTGAAGCATCAGGTGCTATGAAAGCAAGCCAATGGTTCA 
AAGTATCAGATAAATGGTACTATGTCAACAGCAATGGCGCTATGGCGACAGGCTGGCTCCAATACAATGGCTCATGG 
TACTACCTCAACGCTAATGGTGATATGGCGACAGGATGGCTCCAATACAACGGTTCATGGTATTACCTCAACGCTAA 
TGGTGATATGGCGACAGGATGGGCTAAAGTCAACGGTTCATGGTACTACCTAAACGCTAACGGTGCTATGGCTACAG 
GTTGGGCTAAAGTCAACGGTTCATGGTACTACCTAAACGCTAACGGTTCAATGGCAACAGGTTGGGTGAAAGATGGA 
GATACCTGGTACTATCTTGAAGCATCAGGTGCTATGAAAGCAAGCCAATGGTTCAAAGTATCAGATAAATGGTACTA 
TGTCAATGGCTTAGGTGCCCTTGCAGTCAACACAACTGTAGATGGCTATAAAGTCAATGCCAATGGTGAATGGGTTT 
AA 

MNKKKMI LTS LAS VAI LGAGFVTSQ PTFVRAEES PQ WEKS S LE KKYEE AKAKADTAKKDYETAKKKAEDAQKKYED 

DQKRTEEKARKEAEASQKliNDVALWQNAYKEYREVQNQRSKYKSDAEYQKKLTEVDSKIEKARKEQQDLQNK™ 

RAVWPEPNALAETKKKAEEAKAEEKVAKRKYDYATLKVALAKKEVEAK^LEIEKIjQYEISTLEQE 

KiOjI^GADPDDGTEVIEAKLKKGEAELNAKQAELAKKQTELEKLLDSLDPEGKTQDELDKEAEEAELDKKADELQNK 

VADLEKE I SNLE I LLGGADPEDDTAALQNKLAAKKAELAKKQTELEKLLDS LD PEGKTQDELDKEAEEAELDKKADE 

LQNKVADLEKEISNLEILLGGADSEDDTAALQNKLATKKAELEKTQKELDAALNELGPDGDEEETPAPAPQPEQPAP 

APKPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAKPEKPAEEPTQPEKPATPKTGWKQENGMWYFYN 

TDGSMAIGWLQNNGSWYYLNANGA>1^TGWKDGDTWYYLEASGAMKASQWFKVSDK>7YYWSNGAMATGWLQ^ 

YYLNANGD^lATGWLQYNGSWYYLNANGD^^TGWAKVNGSWYYLNANGAMATGWAKVNGSWYYIiNANGS^^ 

DTWYYLEASGAMKASQWFKVSDKWYYWGLGAIiAWTTVDGYKVNANGEWVZ 

ID302 

ATGTTTGCATCAAAAAGCGAAAGAAAAGTACATTATTCAATTCGTAAATTTAGTGTTGGAGTAGCTAGTGTAGTTGT 
TGCCAGTCTTGTTATGGGAAGTGTGGTTCATGCGACAGAGAACGAGGGAGCTACCCAAGTACCCACTTCTTCTAATA 
GGGCAAATGAAAGTCAGGCAGAACAAGGAGAACAACCTAAAAAACTCGATTCAGAACGAGATAAGGCAAGGAAAGAG 
GTCGAGGAATATGTAAAAAAAATAGTGGGTGAGAGCTATGCAAAATCAACTAAAAAGCGACATACAATTACTGTAGC 
TCTAGTTAACGAGTTGAACAACATTAAGAACGAGTATTTGAATAAAATAGTTGAATCAACCTCAGAAAGCCAACTAC 
AGATACTGATGATGGAGAGTCGATCAAAAGTAGATGAAGCTGTGTCTAAGTTTGAAAAGGACTCATCTTCTTCGTCA 
AGTTCAGACTCTTCCACTAAACCGGAAGCTTCAGATACAGCGAAGCCAAACAAGCCGACAGAACCAGGAGA/WIGGT 
AGCAGAAGCTAAGAAGAAGGTTGAAGAAGCTGAGAAAAAAGCCAAGGATCAAAAAGAAGAAGATCGTCGTAACTACC 
CAACCATTACTTACAAAACGCTTGAACTTGAAATTGCTGAGTCCGATGTGGAAGTTAAA73U=J^GCGGAGCTTGAACTA 
GTAAAAGTGAAAGCTAACGAACCTCGAGACGAGCAAAAAATTAAGCAAGCAGAAGCGGAAGTTGAGAGTAAACAAGC 
TGAGGCTACAAGGTTAAA^AAAATCAAGACAGATCGTGAAGAAGCAGAAGAAGAAGCTAAACGAAGAGCAGATGCTA 
AAGAGCAAGGTAAACCAAAGGGGCGGGCAAAACGAGGAGTTCCTGGAGAGCTAGCAACACCTGATAAAAAAGAAAAT 
GATGCGAAGTCTTCAGATTCTAGCGTAGGTGAAGAAACTCTTCCAAGCCCATCCCTGAAACCAGAAAAAAAGGTAGC 
AGAAGCTGAGAAGAAGGTTGAAGAAGCTAAGAAAAAAGCCGAGGATCAAAAAGAAGAAGATCGCCGTAACTACCCAA 
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CCAATACTTACAAAACGCTTGAACTTGAAATTGCTGAGTCCGATGTGGAAGTTAAAAAAGCGGAGCTTGAACTAGTA 

AAAGAGGAAGCTAAGGAACCTCGAAACGAGGAAAAAGTTAAGCAAGCAAAAGCGGAAGTTGAGAGTAAAAAAGCTGA 

GGCTACAAGGTTAGAAAAAATCAAGACAGATCGTAAAAAAGCAGAAGAAGAAGCTAAACGAAAAGCAGCAGAAGAAG 

ATAAAGTTAAAGAAAAACCAGCTGAACAACCACAACCAGCGCCGGCTCCAAAAGCAGAAAAACCAGCTCCAGCTCCA 

AAACCAGAGAATCCAGCTGAAC7UVCCAAAAGCAGAAAAACCAGCTGATCAACAAGCTGAAGAAGACTATG 

ATCAGAAGAAGAATATAATCGCTTGACTCAACAGCAACCGCCAAAAACTGAAAAACCAGCACAACCATCTACTCCAA 

AAACAGGCTGGAAACAAGAAAACGGTATGTGGTACTTCTACAATACTGATGGTTCAATGGCGACAGGATGGCTCCAA 

AACAATGGCTCATGGTACTACCTCAACAGCAATGGCGCTATGGCGACAGGATGGCTCCAAAACAATGGTTCATGGTA 

CTATCTAAACGCTAATGGTTCAATGGCAACAGGATGGCTCCAAAACAATGGTTCATGGTACTACCTAAACGCTAATG 

GTTCAATGGCGACAGGATGGCTCCAATACAATGGCTCATGGTACTACCTAAACGCTAATGGTTCAATGGCGACAGGA 

TGGCTCCAATACAATGGCTCATGGTACTACCTAAACGCTAATGGTGATATGGCGACAGGTTGGGTGAAAGATGGAGA 

TACCTGGTACTATCTTGAAGCATCAGGTGCTATGAAAGCAAGCCAATGGTTCAAAGTATCAGATAAATGGTACTATG 

TCAATGGCTCAGGTGCCCTTGCAGTCAACACAACTGTAGATGGCTATGGAGTCAATGCCAATGGTGAATGGGTAAAC 

TAA 

MFASKSERK^mYSIRKFSVGVASVWASLVMGSVVHATENEGATQVPTSSNRANESQAEQGEQPKKLDSERDK^ 

VEEYVKKIVGESYAKSTKKRHTITVALVNELNNIKNEYU^KIVESTSESQLQILMMESRSKVDEAVSKFEKDSSSSS 

SSDSSTKPEASDTAKPNKPTEPGEKVAEAKKKVEEAEKKAKDQKEEDRRNYPTITYKTLELEIAESDVEVKKAELEL 

VKVKANEPRDEQKIKQAEAEVESKQAEATRLKKIKTDREEAEEEAKRRADAKEQGKPKGRAKRGVPGELATPDKKEN 

DAKSSDSSVGEETLPSPSLKPEKKVAEAEKKVEEAKKKAEDQKEEDRRNYPTNTYKTLELEIAESDVEVKICAELELV 

E AKE PRNEE KVKQAKAE VE S KKAE ATRLE KI KTDRKKAE E EAKRKAAE ED KA^KE KP AEQ PQ P AP AP KAE 
KPENPAEQPKAEKPADQQAEEDYARRSEEEYNRLTQQQPPKTEKPAQPSTPKTGWKQENGMWYFYNTDGSMATGWLQ 
^TNGSWYYLNSNGAMATGWLQNNGSWYYLNANGSMATGWLQNNGSWYYLNANGS^4ATGWLQYNGSWYYLNANGS^^ 
WLQYNGSWYYLNANGDMATGWVKDGDTWYYLEASGAMKASQWFKVSDKWYYWGSGALAWTTVDGYGW;^ 
Z 

ID303 

ATGGTAAAAAGACGTATAAGGAGAGGGACGAGAGAACCTGAAAAAGTTGTTGTTCCTGAGCAATCATCTATTCCTTC 
GTATCCTGTATCTGTTACATCTAACCAAGGAACAGATGTAGCAGTAGAACCAGCTAAAGCAGTTGCTCCAACAACAG 
ACTGGAAACAAGAAAATGGTATGTGGTATTTTTATAATACTGATGGTTCCATGGCAACAGGTTGGGTACAAGTTAAT 
AGTTCATGGTACTACCTCAACAGCAACGGTTCTATGAAAGTCAATCAATGGTTCCAAGTTGGTGGTAAATGGTATTA 
TGTAAATACATCGGGTGAGTTAGCGGTCAATACAAGTATAGATGGCTATAGAGTCAATGATAATGGTGAATGGGTGC 
GTTAA 

MVKRRI RRGTREPEKWVPEQSS I PS YPVS VTSNQGTDVAVEP AKAVAPTTDWKQENGMWYFYNTDGSMATGWVQVN 
SSWYYLNSNGSMKVNQWFQVGGKWYYVNTSGELiAVNTSIDGYRVNDNGEWVRZ 

ID304 

CTGAATACAAGTTTTGTTCATGCTGCTGATGGGATTCAATATGTCAGAGATGATACTAGAGATAAAGAAGAGGGAAT 
AGAGTATGATGACGCTGACAATGGGGATATTATTGTAAAAGTAGCGACTAAACCTAAGGTAGTAACCAAGAAAATTT 
CAAGTACGCGAATTCGTTATGAAAAAGATGAAACAAAAGACCGTAGTGAAAATCCTGTTACAATTGATGGAGAGGAT 
GGCTATGTAACTACGACAAGGACCTACGATGTTAATCCAGAGACTGGTTATGTTACCGAACAGGTTACTGTTGATAG 
AAAAGAAGCCACGGATACAGTTATCAAAGTTCCAGCTAAAAGCAAGGTTGAAGAAGTTCTTGTTCCATTTGCTACTA 
AATATGAAGCAGACAATGACCTTTCTGCAGGACAGGAGCAAGAGATTACTCTAGGAAAGAATGGGAAAACAGTTACA 
ACGATAACTTATAATGTAGATGGAAAGAGTGGACAAGTAACTGAGAGTACTTTAAGTCAAAAAAAAGACTCTCAAAC 
AAGAGTTGTTAAAAAAAGAACCAAGCCCCAAGTTCTTGTCCAAGAAATTCCAATCGAAACAGAATATCTCGATGGCC 
CAACTCTTGATAAAAGTCAAGAAGTAGAAGAAGTAGGAGAAATTGGTAAATTACTCTTACTACAATCTATACTGTAG 

LNTS FVHAADGI QYVRDDTRDKEEG I E YDDADNGD 1 1 VKVATKPKWTKKI SSTRI RYEKDETKDRS ENPVT I DGED 
GYVTTTRTYDWPETGYVTEQVTVDRKEATDWIKVPAKSKVEEVLVPFATKYEADNDLSAGQEQEITLGKNGKTVT 
TITYNVDGKSGQVTESTLSQKKDSQTRWKKRTKPQVLVQEIPIETEYLDGPTLDKSQEVEEVGEIGKLLLLQSILZ 



ID305 

ATGAAGCTTTTGAAAAAAATGATGCAAATCGCACTAGCCACATTTTTCTTCGGTTTGTTAGCGACAAATACAGTATT 
TGCAGATGATTCTGAAGGATGGCAGTTTGTCCAAGAAAATGGTAGAACCTACTACAAAAAGGGGGATCTAAAAGAAA 
CCTACTGGAGAGTGATAGATGGGAAGTACTATTATTTTGATCCTTTATCCGGAGAGATGGTTGTCGGCTGGCAATAT 
ATACCTGCTCCACACAAGGGGGTTACGATTGGTC CTTCTCCAAG AAT AGAGAT T GC T C T T AGAC C AG ATTGGTTTTA 
TTTTGGTCAAGATGGTGTATTACAAGAATTTGTTGGCAAGCAAGTTTTAGAAGCAAAAACTGCTACGAATACCAACA 
AACATCATGGGGAAGAATATGATAGCCAAGCAGAGAAACGAGTCTATTATTTTGAAGATCAGCGTAGTTATCATACT 
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TTAAAAACTGGTTGGATTTATGAAGAGGGTCATTGGTATTATTTACAGAAGGATGGTGGCTTTGATTCGCGCATCAA 
CAGATTGACGGTTGGAGAGCTAGCACGTGGTTGGGTTAAGGATTACCCTCTTACGTATGATGAAGAGAAGCTAAAAG 
CAGCTCCATGGTACTATCTAAATCCAGCAACTGGCATTATGCAAACAGGTTGGCAATATCTAGGTAATAGATGGTAC 
TACCTCCATTCGTCAGGAGCTATGGCAACTGGCTGGTATAAGGAAGGCTCAACTTGGTACTATCTAGATGCTGAAAA 
5 TGGTGATATGAGAACTGGCTGGCAAAACCTTGGGAACAAATGGTACTATCTCCGTTCATCAGGAGCTATGGCAACTG 
GTTGGTATCAGGAAAGTTCGACTTGGTACTATCTAAATGCAAGTAATGGAGATATGAAAACAGGCTGGTTCCAAGTC 
AATGGTAACTGGTACTATGCCTATGATTCAGGTGCTTTAGCTGTTAATACCACAGTAGGTGGTTACTACTTAAACTA 
TAATGGTGAATGGGTTAAGTAA 

10 MKLLKKl^QIAIiATFFFGLI^TNTVFADDSEGWQFVQENGRTYYKKGDLKETYWRVIDGKYYYFDPLSGEMWGW 

IPAPHKGVTIGPSPRIEIALRPDWFYFGQDGVLQEFVGKQVLEAKTATNTNKHHGEEYDSQAEKRVYYFEDQRSYHT 
LKTGWIYEEGHWYYLQKI>GGFDSRINRLTVGEIiARGWKDYPLTYDEEKLKAAPWYYLNPATGIMQTGWQYLGNRWY 
YLHSSGAMATGWYKEGSTWYYLDAENGDMRTGWQNLGNKWYYLRSSGAMATGWYQESSTWYYLNASNGDMKTGWFQV 
NGNWYYAYDSGALAVNTTVGGYYIiNYNGEWVKZ 

15 

ID306 

TTGGCTGGTAGATATGGTTCTGCTGTTCAGTGTACAGAAGTGACTGCCTCAAACCTTTCAACAGTTAAAACTAAAGC 
TACGGTTGTAGAAAAACCACTGAAAGATTTTAGAGCGTCTACGTCTGATCAGTCTGGTTGGGTGGAATCTAATGGTA 
AATGGTATTTCTATGAGTCTGGTGATGTGAAGACAGGTTGGGTGAAAACAGATGGTAAATGGTACTATTTGAATGAC 
20 TTAGGTGTCATGCAGACTGGATTTGTAAAATTTTCTGGTAGCTGGTATTACTTGAGCAATTCAGGTGCTATGTTTAC 
AGGCTGGGGAACAGATGGTAGCAGATGGTTCTACTTTGACGGCTCAGGAGCTATGAAGACAGGCTGGTACAAGGAAA 
ATGGCACTTGGTATTACCTTGACGAAGCAGGTATCATGAAGACAGGTTGGTTTAAAGTCGGACCACACTGGTACTAT 
GCCTACGGTTCAGGAGCTTTGGCTGTGAGCACAACAACACCAGATGGTTACCGTGTAAATGGTAATGGTGAATGGGT 

AAACTAG 

25 

I^GRYGSAVQCTEVTASNLSTVKTKATWEKPLKDFRASTSDQSGWVESNGKWYFYESGDVKTGWVKTDGKWYYLND 
LGVMQTGFVKFSGSWYYLSNSGAMFTGWGTDGSRWFYFDGSGAMKTGWYKENGTWYYLDEAGIMKTGWFKVGPHWYY 
AYGSGALAVSTTTPDGYRVNGNGEWVNZ 

30 ID307 

ATGAAAATTTTGAAAAAAACTATGCAAGTTGGACTGACAGTATTTTTCTTTGGTTTGCTAGGGACCAGTACAGTATT 
TGCAGATGATTCTGAAGGATGGCAGTTTGTCCAAGAAAACGGAAGAACCTACTACAAAAAGGGGGACCTCAAAGAAA 
CCTACTGGCGAGTGATTGATGGTAAGTACTATTATTTTGATTCTCTATCTGGAGAGATGGTTGTCGGCTGGCAATAT 
ATCCCGTTTCCATCTAAAGGTAGTACAATTGGTCCTTACCCAAATGGTATCAGATTAGAAGGTTTTCCAAAGTCAGA 

35 GTGGTACTACTTCGATAAAAATGGAGTGCTACAAGAGTTTGTTGGTTGGAAAACATTAGAGATTAAAACTAAAGACA 
GTGTTGGAAGAAAGTACGGGGAAAAACGTGAAGATTCAGAAGATAAAGAAGAGAAGCGTTATTATACGAACTATTAC 
TTTAATCAAAATCATTCTTTAGAGACAGGTTGGCTTTATGATCAGTCTAACTGGTATTATCTAGCTAAGACGGAAAT 
Ti\ATGGAGAAAACTACCTTGGTGGTGAAAGACGTGCGGGGTGGATAAACGATGATTCGACTTGGTACTACCTAGATC 
CAACAACTGGTATTATGCAAACAGGTTGGCAATATCTAGGTAATAAGTGGTACTACCTCCGTTCCTCAGGAGCAATG 

40 GCCACTGGCTGGTATCAGGAAGGTACCACTTGGTATTATTTAGACCACCCAAATGGCGATATGAAAACAGGTTGGCA 
AAACCTTGGGAACAAATGGTACTATCTCCGTTCATCAGGAGCTATGGCAACTGGTTGGTATCAAGATGGTTCAACTT 
GGTACTACCTAAATGCAGGTAATGGAGACATGAAGACAGGTTGGTTCCAGGTCAATGGCAACTGGTACTATGCTTAT 
AGCTCAGGTGCTTTGGCAGTGAATACGACCGTAGATGGCTATTCTGTCAACTATAATGGCGAATGGGTTCGGTAA 

45 MKILKKTMQVGLWFFFGLLGTSTVFADDSEGWQFVQENGRTYYKKGDLKJETYWRVIDGKYYYFDSLSGEMVVGWQY 
IPFPSKGSTIGPYPNGIRLEGFPKSEWYYFDKNGVLQEFVGWKTLEIKTKDSVGRKYGEKREDSEDKEEKRYYTNYY 
FNQNHSLETGWLYDQS^^A^YY]^KTEINGENYLGGERRAGWI^^DDSTWYYLDPTTGIMQTGWQYLGNKWYYLRSSGAM 
ATGWYQEGTTWYYLDHPNGDMKTGWQNLGNKWYYLRSSGAMATGWYQDGSTWYYLNAGNGDMKTGWFQVNGNWYYAY 
SSGALAVNTTVDGYSVNYNGEWVRZ 

50 



ID308 

ATGAAAAAGAAATTAACTAGTTTAGCACTTGTAGGCGCTTTTTTAGGTTTGTCATGGTATGGGAATGTTCAGGCTCA 
AGAAAGTTCAGGAAATAAAATCCACTTTATCAATGTTCAAGAAGGTGGCAGTGATGCGATTATTCTTGAAAGCAATG 

55 GACATTTTGCCATGGTGGATACAGGAGAAGATTATGATTTCCCAGATGGAAGTGATTCTCGCTATCCATGGAGAGAA 
GGAATTGAAACGTCTTATAAGCATGTTCTAACAGACCGTGTCTTTCGTCGTTTGAAGGAATTGGGTGTCCAAAAACT 
TGATTTTATTTTGGTGACCCATACCCACAGTGATCATATTGGAAATGTTGATGAATTACTGTCTACCTATCCAGTTG 
ACCGAGTCTATCTTAAGAAATATAGTGATAGTCGTATTACTAATTCTGAACGTCTATGGGATAATCTGTATGGCTAT 
GATAAGGTTTTACAGACTGCTGCAGAAAAAGGTGTTTCAGTTATTCAAAATATCACACAAGGGGATGCTCATTTTCA 

60 GTTTGGGGACATGGATATTCAGCTCTATAATTATGAAAATGAAACTGATTCATCGGGTGAATTAAAGAAAATTTGGG 
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ATGACAATTCCAATTCCTTGATTAGCGTGGTGAAAGTCAATGGCAAGAAAATTTACCTTGGGGGCGATTTAGATAAT 
GTTCATGGAGCAGAAGACAAGTATGGTCCTCTCATTGGAAAAGTTGATTTGATGAAGTTTAATCATCACCATGATAC 
CAACAAATCAAATACCAAGGATTTCATTAAAAATTTGAGTCCGAGTTTGATTGTTCAAACTTCGGATAGTCTACCTT 
GGAAAAATGGTGTTGATAGTGAGTATGTTAATTGGCTCAAAGAACGAGGAATTGAGAGAATCAACGCAGCCAGCAAA 
GACTATGATGCAACAGTTTTTGATATTCGAAAAGACGGTTTTGTCAATATTTCAACATCCTACAAGCCGATTCCAAG 
TTTTCAAGCTGGTTGGCATAAGAGTGCATATGGGAACTGGTGGTATCAAGCGCCTGATTCTACAGGAGAGTATGCTG 
TCGGTTGGAATGAAATCGAAGGTGAATGGTATTACTTTAACCAAACGGGTATCTTGTTACAGAATCAATGGAAAAAA 
TGGAACAATCATTGGTTCTATTTGACAGACTCTGGTGCTTCTGCTAAAAATTGGAAGAAAATCGCTGGAATCTGGTA 
TTATTTTAACAAAGAAAACCAGATGGAAATTGGTTGGATTCAAGATAAAGAGCAGTGGTATTATTTGGATGTTGATG 
GTTCTATGAAGACAGGATGGCTTCAATATATGGGGCAATGGTATTACTTTGCTCCATCAGGGGAAATGAAAATGGGC 
TGGGTAAAAGATAAAGAAACCTGGTACTATATGGATTCTACTGGTGTCATGAAGACAGGTGAGATAGAAGTTGCTGG 
TCAACATTATTATCTGGAAGATTCAGGAGCTATGAAGCAAGGCTGGCATAAAAAGGCAAATGATTGGTATTTCTACA 
AGACAGACGGTTCACGAGCTGTGGGTTGGATCAAGGACAAGGATAAATGGTACTTCTTGAAAGAAAATGGTCAATTA 
CTTGTGAACGGTAAGACACCAGAAGGTTATACTGTGGATTCAAGTGGTGCCTGGTTAGTGGATGTTTCGATCGAGAA 
ATCTGCTACAATTAAAACTACAAGTCATTCAGAAATAAAAGAATCCAAAGAAGTAGTGAAAAAGGATCTTGAAAATA 
AAGAAACGAGTCAACATGAAAGTGTTACAAATTTTTCAACTAGTCAAGATTTGACATCCTCAACTTCACAAAGCTCT 
GAAACGAGTGTAAACAAATCGGAATCAGAACAGTAG 

MKKKLTSLALVGAFLGLSWYGNVQAQESSGNKIHFINVQEGGSDAIILESNGHFAMVDTGEDYDFPDGSDSRYPWRE 

GIETSYKHVLTDRVFRRIiKELGVQKLDFILVTHTHSDHIGNVDELLSTYPVDRVYLKKYSDSRITNSERLWDNLYGY 

DKVLQTAAEKGVSVIQNITQGDAHFQFGDMDIQLYNYENETDSSGELKKIWDDNSNSLISWKVNGKKIYLGGDLDN 

VHGAEDKYGPLIGKVDLMKFNHHHDTNKSNTKDFIKNLSPSLIVQTSDSLPWKNGVDSEYVNWLKERGIERIN/^ 

DYDATVFDIRKDGFVNISTSYKPIPSFQAGWHKSAYGNWWYQAPDSTGEYAVGWNEIEGEWYYFNQTGILLQNQWKK 

WNNHWFYLTDSGASAKlSrWKKIAGIWYYFNKENQMEIGWIQDKEQWYYLDVDGSMKTGWLQYMGQWYYFAP 

WVKDKETWYYMDSTGVMKTGEIEVAGQHYYLEDSGAMKQGWHKKANDWYFYKTDGSRAVGWIKDKDK^YFLK^ 

LVNGKTPEGYTVDSSGAWLVDVSIEKSATIKTTSHSEIKESKEWKKDLENKETSQHESVTNFSTSQDLTSSTSQSS 

ETSVNKSESEQZ 

ID309 

ATGGAAATTAATGTGAGTAAATTAAGAACAGATTTGCCTCAAGTCGGCGTGCAACCATATAGGCAAGTACACGCACA 
CTCAACTGGGAATCCGCATTCAACCGTACAGAATGAAGCGGATTATCACTGGCGGAAAGACCCAGAATTAGGTTTTT 
TCTCGCACATTGTTGGGAACGGTTGCATCATGCAGGTAGGACCTGTTGATAATGGTGCCTGGGACGTTGGGGGCGGT 
TGGAATGCTGAGACCTATGCAGCGGTTGAACTGATTGAAAGCCATTCAACCAAAGAAGAGTTCATGACGGACTACCG 
CCTTTATATCGAACTCTTACGCAATCTAGCAGATGAAGCAGGTTTGCCGAAAACGCTTGATACAGGGAGTTTAGCTG 
GAATTAAAACGCACGAGTATTGCACGAATAACCAACCAAACAACCACTCAGACCACGTTGACCCTTATCCATATCTT 
GCTAAATGGGGCATTAGCCGTGAGCAGTTTAAGCATGATATTGAGAACGGCTTGACGATTGAAACAGGCTGGCAGAA 
GAATGACACTGGCTACTGGTACGTACATTCAGACGGCTCTTATCCAAAAGACAAGTTTGAGAAAATCAATGGCACTT 
GGTACTACTTTGACAGTTCAGGCTATATGCTTGCAGACCGCTGGAGGAAGCACACAGACGGCAACTGGTACTGGTTC 
GACAACTCAGGCGAAATGGCTACAGGCTGGAAGAAAATCGCTGATAAGTGGTACTATTTCAACGAAGAAGGTGCCAT 
GAAGACAGGCTGGGTCAAGTACAAGGACACTTGGTACTACTTAGACGCTAAAGAAGGCGCCATGGTATCAAATGCCT 
TTATCCAGTCAGCGGACGGAACAGGCTGGTACTACCTCAAACCAGACGGAACACTGGCAGACAAGCCAGAATTCACA 
GTAGAGCCAGATGGCTTGATTACAGTAAAATAA 

MEINVSKLRTDLPQVGVQPYRQVHAHSTGNPHSTVQNEADYHWRKDPELGFFSHIVGNGCIMQVGPVDNGAWDVGGG 
WNAETYAAVELIESHSTKEEFMTDYRLYIELLRNLADEAGLPKTLDTGSLiAGIKTHEYCTNNQPNNHSDHVDPYPYL 
AKWGISREQFKHDIENGLTIETGWQKNDTGYWYVHSDGSYPKDKFEKINGTWYYFDSSGYMLADRWRKHTDGNWYWF 
DNSGEMATGWKKIADKWYYFNEEGAMKTGWVKYKDTWYYLDAKEGAMVSNAFIQSADGTGWYYLKPDGTLADKPEFT 
VEPDGLITVKZ 

ID310 

ATGGGCACAACAGGATTTACAATAATTGACTTAATTATCTTGATTGTTTATTTACTTGCGGTGTTGGTTGCAGGTAT 
CTATTTCTCTAAAAAAGAGATGAAAGGAAAAGAGTTCTTTAAAGGAGATGGTTCGGTTCCTTGGTATGTTACTTCGG 
TATCCATTTTTGCCACAATGCTCAGTCCGATTTCCTTCTTGGGACTCGCTGGTAGCTCTTATGCAGGTAGCTGGATT 
TTATGGTTTGCTCAATTAGGGATGGTAGTAGCTATTCCACTGACAATTCGTTTTATCTTACCTATCTTTGCACGGAT 
AGACATCGATACGGCATATGATTACTTGGATAAACGTTTTAATTCTAAAGCACTTCGTATTATTTCAGCACTCTTGT 
TTATTATTTATCAATTGGGACGTATGTCTATCATTATGTACCTCCCATCAGCTGGTTTATCAGTATTGACAGGAATT 
GACATCAATATTTTGATTATTTTGATGGGTGTAGTTGCAATTGTTTATTCTTATACTGGTGGTCTAAAATCCGTATT 
ATGGACAGACTTTATTCAAGGTGTGATTCTGATTAGTGGTGTCGTTTTAGCTTTATTTGTACTGATTGCTAATATTA 
AAGGTGGCTTTGGTGCAGTAGCAGAAACATTAGCAAACGGGAAATTCCTTGCTGCAAATGAAAAACTTTTCGATCCT 
AACTTGCTTTCAAACTCCATCTTTTTAATTGTGATGGGTTCAGGCTTTACAATCTTGTCTTCCTATGCTTCATCTCA 
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AGATTTGGTTCAACGTTTTACTACAACACAAAATATTAAGAAACTTAATAAGATGTTGTTCACAAACGGTGTTTTGT 
CACTTGCAACTGCAACAGTCTTTTACTTGATTGGTACAGGCTTGTACGTATTCTATCAAGTACAAAATGCAGATAGT 
GCAGCTAGCAATATCCCTCAAGACCAAATCTTTATGTACTTTATTGCATACCAGTTACCAGTAGGTATCACAGGTTT 
GATCTTGGCAGCGATTTATGCAGCATCTCAATCAACTATTTCAACAGGTTTGAACTCTGTTGCAACTTCATGGACAT 
5 TGGATATTCAAGATGTCATTTCTAAAAATATGTCAGACAATCGTCGTACGAAAATTGCACAATTCGTATCTCTAGCA 
GTAGGTTTATTCTCAATTGGTGTTTCCATTGTCATGGCTCACTCAGATATTAAATCTGCATACGAATGGTTCAATAG 
TTTCATGGGACTTGTACTTGGTCTACTTGGTGGTGTATTTATTCTTGGATTTGTTTCTAAAAAAGCAAATAAACAAG 
GTGCTTATGCAGCGCTGATTGTATCAACCATCGTCATGGTATTTATTAAATACTTCCTTCCTCCAACAGCTGTTAGC 
TACTGGGCATATTCATTGATTTCAATCTCTGTATCAGTAGTTTCAGGTTATATTGTATCTGTTCTTACTGGAAATAA 
10 AGTATCTGCACCTAAATATACAACGATTCATGATATTACAGAAATTAAAGCGGATTCAAGTTGGGAAGTTCGTCACT 

AA 

MGTTGFTIIDLIILIVYLLAVLVAGIYFSKKEMKGKEFFKGDGSVPWYVTSVSIFATMLSPISFLGIiAGSSYAGSWI 
LWFAQLGMVVAIPLTIRFILPIFARIDIDTAYDYLDKRFNSKALRIISALLFIIYQLGRMSIIMYLPSAGLSVLTGI 
15 DINILIILMGWAIVYSYTGGLK5VLWTDFIQGVILISGWLALFVLIANIKGGFGAVAETLANGKFLAANEKLFDP 
NLLSNS I FLI VMGSGFTILSSYASSQDLVQRFTTTQNI KKLNKMLFTNGVLSI^TATVFYLIGTGLYVFYQVQNADS 
AASNIPQDQIFMYFIAYQLPVGITGLIIJ\AIYAASQSTISTGLNSVATSWTLDIQDVISKNMSDNRRTKIAQFVSLA 
VGLFSIGVSIVMAHSDIKSAYEWFNSFMGLVLGLLGGVFILGFVSKKANKQGAYAALIVSTIVMVFIKYFLPPTAVS 
YWAYSLISISVSWSGYIVSVLTGNKVSAPKYTTIHDITEIKADSSWEVRHZ 

20 

ID311 

ATGAAAATTAATAAAAAATATCTAGCAGGTTCAGTGGCAGTCCTTGCCCTAAGTGTTTGTTCCTATGAGCTTGGTCG 
TCACCAAGCTGGTCAGGATAAGAAAGAGTCTAATCGAGTTGCTTATATAGATGGTGATCAGGCTGGTCAAAAGGCAG 
AAAACTTGACACCAGATGAAGTCAGTAAGAGGGAGGGGATCAACGCCGAACAAATCGTCATCAAGATTACGGATCAA 

25 GGTTATGTGACCTCTCATGGAGACCATTATCATTACTATAATGGCAAGGTCCCTTATGATGCCATCATCAGTGAAGA 
GCTCCTCATGAAAGATCCGAATTATCAGTTGAAGGATTCAGACATTGTCAATGAAATCAAGGGTGGTTATGTCATCA 
AGGTAGACGGAAAATACTATGTTTACCTTAAGGATGCAGCTCATGCGGATAATATTCGGACAAAAGAAGAGATTAAA 
CGTCAGAAGCAGGAACGCAGTCATAATCACGGGTCAGGAGCTAACGATCATGCAGTAGCTGCAGCCAGAGCCCAAGG 
ACGCTATACAACGGATGATGGGTATATCTTCAATGCATCTGATATCATTGAGGACACGGGTGATGCTTATATCGTTC 

30 CTCACGGCGACCATTACCATTACATTCCTAAGAATGAGTTATCAGCTAGCGAGTTAGCTGCTGCAGAAGCCTATTGG 
AATGGGAAGCAGGGATCTCGTCCTTCTTCAAGTTCTAGTTATAATGCAAATCCAGCTCAACCAAGATTGTCAGAGAA 
CCACAATCTGACTGTCACTCCAACTTATCATCAAAATCAAGGGGAAAACATTTCAAGCCTTTTACGTGAATTGTATG 
CTAAACCCTTATCAGAACGCCATGTGGAATCTGATGGCCTTATTTTCGACCCAGCGCAAATCACAAGTCGAACCGCC 
AGAGGTGTAGCTGTCCCTCATGGTAACCATTACCACTTTATCCCTTATGAACAAATGTCTGAATTGGAAAAACGAAT 

35 TGCTCGTATTATTCCCCTTCGTTATCGTTCAAACCATTGGGTACCAGATTCAAGACCAGAACAACCT^GTCCACAAT 
CGACTCCGGAACCTAGTCCAAGTCCGCAACCTGCACCAAATCCTCAACCAGCTCCAAGCAATCCAATTGATGAGAAA 
TTGGTCAAAGAAGCTGTTCGAAAAGTAGGCGATGGTTATGTCTTTGAGGAGAATGGAGTTTCTCGTTATATCCCAGC 
CAAGGATCTTTCAGCAGAAACAGCAGCAGGCATTGATAGCAAACTGGCCAAGCAGGAAAGTTTATCTCATAAGCTAG 
GAGCTAAGAAAACTGACCTCCCATCTAGTGATCGAGAATTTTACAATAAGGCTTATGACTTACTAGCAAGAATTCAC 

40 CAAGATTTACTTGATAATAAAGGTCGACAAGTTGATTTTGAGGCTTTGGATAACCTGTTGGAACGACTCAAGGATGT 
CCCAAGTGATAAAGTCAAGTTAGTGGATGATATTCTTGCCTTCTTAGCTCCGATTCGTCATCCAGAACGTTTAGGAA 
AACCAAATGCGCAAATTACCTACACTGATGATGAGATTCAAGTAGCCAAGTTGGCAGGCAAGTACACAACAGAAGAC 
GGTTATATCTTTGATCCTCGTGATATAACCAGTGATGAGGGGGATGCCTATGTAACTCCACATATGACCCATAGCCA 
CTGGATTAAAAAAGATAGTTTGTCTGAAGCTGAGAGAGCGGCAGCCCAGGCTTATGCTAAAGAGAAAGGTTTGACCC 

45 CTCCTTCGACAGACCATCAGGATTCAGGAAATACTGAGGCAAAAGGAGCAGAAGCTATCTACAACCGCGTGAAAGCA 
GCTAAGAAGGTGCCACTTGATCGTATGCCTTACAATCTTCAATATACTGTAGAAGTCAAAAACGGTAGTTTAATCAT 
ACCTCATTATGACCATTACCATAACATCAAATTTGAGTGGTTTGACGAAGGCCTTTATGAGGCACCTAAGGGGTATA 
CTCTTGAGGATCTTTTGGCGACTGTCAAGTACTATGTCGAACATCCAAACGAACGTCCGCATTCAGATAATGGTTTT 
GGTAACGCTAGCGACCATGTTCAAAGAAACAAAAATGGTCAAGCTGATACCAATCAAACGGAAAAACCAAGCGAGGA 

50 GAAACCTCAGACAGAAAAACCTGAGGAAGAAACCCCTCGAGAAGAGAAACCGCAAAGCGAGAAACCAGAGTCTCCAA 
AACCAACAGAGGAACCAGAAGAATCACCAGAGGAATCAGAAGAACCTCAGGTCGAGACTGAAAAGGTTGAAGAAAAA 
CTGAGAGAGGCTGAAGATTTACTTGGAAAAATCCAGGATCCAATTATCAAGTCCAATGCCAAAGAGACTCTCACAGG 
ATTAAAAAATAATTTACTATTTGGCACCCAGGACAACAATACTATTATGGCAGAAGCTGAAAAACTATTGGCTTTAT 
TAAAGGAGAGTAAGTAA 

55 

MKINKKYLAGSVAVLALSVCSYELGRHQAGQDKKESNRVAYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQ 
GYVTSHGDHYHYYNGKVPYDAI I SEELLMKDPNYQLKDSD I VNEI KGGYVI KVDGKYYVYLKDAAHADNI RTKEE I K 
RQKQERSHNHGSGANDHAVAAARAQGRYTTDDGYIFNASDXIEDTGDAYIVPHGDHYHYIPKNELSASELAAAEAYW 
NGKQGSRPSSSSSYNANPAQPRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTA 
60 RGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDSRPEQPSPQSTPEPSPSPQPAPNPQPAPSNPIDEK 
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LVKEAVRKVGDGYVFEENGVSRYIPAKDLSAETAAGIDSKLAKQESLSHKLGAKKTDLPSSDREFYNKAYDLLARIH 
QDLLDNKGRQVDFEALDNLLERLKDVPSDKVKLVDDILAFLAPIRHPERLGKPNAQITYTDDEIQVAKIiAGKYTTE 
GYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAKGAEAIYNRVKA 
AKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHNIKFEWFDEGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGF 
5 GNASDHVQRNKNGQADTNQTEKPSEEKPQTEKPEEETPREEKPQSEKPESPKPTEEPEESPEESEEPQVETEKVEEK 
LREAEDLLGKIQDPIIKSNAKETLTGLKNNLLFGTQDNNTIMAEAEKLLALLKESKZ 

ID312 

ATGGAGGGATTGGTTAGAGTGCATTTATTGCCTGTATTTGGCGATTACAAGCTATCTAAACTTACTACGCCTATTCT 
10 TCAACAGCAAGTAAACAAATGGGCTGACAAGGCA/y^TAAAGGCGAAAAAGGGGCATTTGCTAACTACTCTTTGCTCC 
ATAACATGAATAAGCGTATTTTGAAATATGGCGTAGCTATCCAGGTAATACAATACAACCCAGCTAATGATGTCATC 
GTTCCACGCAAACAGCAAAAAGAAAAGGCTGCTGTCAAATACTTAGACAACAAAGAATTAAAACAGTTTCTTGATTA 
TTTAGATGCTCTGGATCAATCAAATTATGAGAACTTATTTGATGTTGTTCTGTATAAGACTTTATTGGCCACTGGTT 
GCCGTATTAGTGAGGCTCTGGCTCTTGAATGGTCTGATATTGACCTAGAAAGCGGTGTTATCAGCATCAATAAGACA 
15 CTAAACCGCTATCAGGAAATAAACTCACCTAAATCAAGCGCTGGTTATCGTGATATACCAATAGACAAAGCCACATT 
ACTTTTACTGAAACAATACAAAAACCGTCAACAAATTCAGTCTTGGAAATTAGGCCGATCTGAAACAGTTGTATTCT 
CTGTATTTACGGAGAAATATGCTTATGCTTGTAACTTACGCAAACGCCTAAATAAGCATTTTGATGCTGCTGGAGTA 
ACTAACGTATCATTTCATGGTTTCCGCCATACACATACTACTATGATGCTCTATGCTCAGGTTAGCCCGAAAGATGT 
TCAGTATAGATTAGGCCACTCTAATTTAATGATCACTGAAAATACTTACTGGCATACTAACCAAGAGAATGCAAAAA 
20 AAGCCGTCTCAAATTATGAAACAGCTATCAACAATTTATAA 

MEGLVRVHLLPVFGDYKLS KLTTP I LQQQVNKWADKANKGE KGAFANYS LLHNMNKRI LKYGVAI QVIQ YNPANDVI 
VPRKQQKSKAAVKYLDNKELKQFLDYLDALDQSNYENLFDWLYKTLIxATGCRISEAIJVLEWSDIDLESGVISINKT 
LNRYQEINSPKSSAGYRDIPIDKATLLLLKQYKNRQQIQSWKl.GRSETWFSVFTEKYAYACNLRKilLNKHFDAAGV 
25 TNVSFHGFRHTHTTMMLYAQVSPKDVQYRLGHSNLMITENTYWHTNQENAKKAVSNYETAINNLZ 
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CLAIMS: 

1 . A Streptococcus pneumoniae protein or polypeptide having a sequence 
selected from those shown in table 2. 

2. A Streptococcus pneumoniae protein or polypeptide having a sequence 
selected from those shown in table 4. 

3 . A protein or polypeptide as claimed in claim 1 or claim 2 provided in 
substantially pure form. 

4. A protein or polypeptide which is substantially identical to one defined in any 
one of claims 1 to 3 . 

5. A homologue or derivative of a protein or polypeptide as defined in any one of 
claims 1 to 4. 

6. An antigenic and/or immunogenic fragment of a protein or polypeptide as 
defined 

in Tables 2-4. 

7. A nucleic acid molecule comprising or consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 1 or their RNA equivalents; 

(ii) a sequence which is complementary to any of the sequences of (i); 

(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 
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(iv) a sequence which is substantially identical with any of those of (i), (ii) 
and (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a 
protein as defined in Table 1 . 

8. A nucleic acid molecule comprising or consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 4 or their RNA equivalents; 

(ii) a sequence which is complementary to any of the sequences of (i); 

(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 

(iv) a sequence which is substantially identical with any of those of (i), (ii) 
and (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a 
protein as defined in Table 4. 

9. The use of a protein or polypeptide having a sequence selected from those 
shown in Tables 2-4, or homologues, derivatives and/or fragments thereof, as an 
immunogen and/or antigen. 

10. An immunogenic and/or antigenic composition comprising one or more 
proteins or polypeptides selected from those whose sequences are shown in Tables 2- 
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4, or homologues or derivatives thereof, and/or fragments of any of these. 

11. An immunogenic and/or antigenic composition as clauned in claim 10 which 
is a vaccine or is for use in a diagnostic assay. 

5 

12. A vaccine as claimed in claim 11 which comprises one or more additional 
components selected from excipients, diluents, adjuvants or the like. 

13. A vaccine composition comprising one or more nucleic acid sequences as 
10 defined in Tables 1, 3 or 4. 

14. A method for the detection/diagnosis of S. pneumoniae which comprises the 
step of bringing into contact a sample to be tested with at least one protein or 
polypeptide as defined in Tables 2-4, or homologue, derivative or fragment thereof. 

15 

15. An antibody capable of binding to a protein or polypeptide as defined in 
Tables 2-4, or for a homologue, derivative or fragment thereof. 

16. An antibody as defined in claim 15 which is a monoclonal antibody. 

20 

17. A method for the detection/diagnosis of S. pneumoniae which comprises the step 
of bringing into contact a sample to be tested and at least one antibody as define din 
claim 15 or claim 16. 

25 18. A method for the detection/diagnosis of S.pneumoniae which comprises the 
step of bringing into contact a sample to be tested with at least one nucleic acid 
sequence as defined in claim 7 or claim 8. 
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19. A method of determining whether a protein or polypeptide as defined in 
Tables 2-4 represents a potential anti-microbial target which comprises inactivating 
said protein or polypeptide and determining whether S.pneumoniae is still viable in 
vitro or in vivo, 

20. The use of an agent capable of antagonising, inhibiting or otherwise 
interfering with the function or expression of a protein or polypeptide as defined in 
Tables 2-4 in the manufacture of a medicament for use in the treatment or 
prophylaxis of S.pneumoniae infection 
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inhibiting or otherwise interfering with the function or expression of a 
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said protein or polypeptide, and its use as a vaccine; 
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molecule; method of determining whether said protein or 
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as invention 1 but limited to subject-matter relating SEQ ID 
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